HPC Engineer/Data Storage Engineer
📍 Job Overview
- Job Title: HPC Engineer/Data Storage Engineer
- Company: Sabre Systems Inc.
- Location: Arlington, Virginia
- Job Type: Full-Time
- Category: Server Administrator/Web Infrastructure
- Date Posted: 2025-02-10
- Experience Level: Mid/Senior
- Remote Status: On-site
🚀 Role Summary
- 📝 Enhancement Note: This role focuses on managing and optimizing high-performance computing (HPC) data storage systems, with a strong emphasis on the Lustre filesystem and Linux system administration. It requires a blend of technical expertise in data storage, scripting, and networking, along with experience in a production computing environment.
💻 Primary Responsibilities
-
📝 Enhancement Note: The primary responsibilities revolve around managing and optimizing HPC data storage systems, with a focus on capacity planning, performance management, and hardware management.
-
💡 Manage and optimize HPC data storage systems at the local site and assist remote locations as needed, ensuring high availability and performance.
-
💡 Perform capacity planning and management to anticipate and mitigate storage capacity issues, balancing current and future needs.
-
💡 Monitor and manage storage system performance, identifying bottlenecks and implementing optimizations to improve overall system performance.
-
💡 Develop and maintain metrics reporting to track storage system performance, capacity usage, and other relevant metrics.
-
💡 Enforce storage quotas and policies to ensure efficient use of storage resources and maintain data security.
-
💡 Manage and tune storage system hardware to ensure optimal performance and reliability, including SAN infrastructure.
-
💡 Collaborate with the team to ensure consistent storage management practices and contribute to overall system architecture decisions.
🎓 Skills & Qualifications
Education: A Bachelor's degree in Computer Science, Information Technology, or a related field is typically required. Relevant coursework or equivalent experience may be considered.
Experience: At least 3 years of related experience in managing and administering Linux-based HPC data storage systems is required. Experience in a production computing environment is highly desirable.
Required Skills:
- 💡 Proven expertise in managing and administering the Lustre filesystem in a production environment.
- 💡 Extensive knowledge of building, configuring, and administering production Linux computer/storage systems.
- 💡 Proficiency in scripting with Bash, Perl, Python, or similar languages to automate tasks and improve efficiency.
- 💡 Broad knowledge of data storage administration, including capacity planning, performance management, and hardware management.
- 💡 Familiarity with storage system hardware and relevant protocols, such as SAN infrastructure.
- 💡 Working knowledge of networking concepts and practices, including relevant protocols and technologies.
- 💡 Knowledge of or experience with hardware and software security practices to ensure data protection and system integrity.
Preferred Skills:
- 💡 Demonstrated experience building, configuring, and administering Lustre in a production Linux HPC environment.
- 💡 Knowledge of file systems such as ZFS, EXT, XFS, and experience working with relational database systems.
- 💡 Experience with object storage and RESTful storage interfaces, as well as distributed file systems and advanced file system concepts.
- 💡 Familiarity with relevant storage system protocols and hardware, including NVMe-based storage devices and computational storage devices.
- 💡 Experience with Ansible and Git for infrastructure automation and version control.
- 💡 Experience diagnosing system software problems and supporting a scientific user base.
- 💡 Experience with multiple network technologies (e.g., Ethernet, IB, OPA) and low-level system administration tools (e.g., perf, strace, tcpdump, vmstat).
- 💡 Experience managing computers in a DOD classified environment.
📊 Web Portfolio & Project Requirements
Portfolio Essentials:
- 💡 Include detailed case studies demonstrating your experience managing and optimizing HPC data storage systems, highlighting the challenges faced and the solutions implemented.
- 💡 Showcase your scripting skills by providing examples of scripts you've written to automate tasks and improve storage system efficiency.
- 💡 Provide evidence of your experience with Lustre and other relevant file systems, including any certifications or training you've completed.
- 💡 Include any relevant metrics and performance reports you've developed to track storage system performance and capacity usage.
Technical Documentation:
- 💡 Prepare a detailed document outlining your approach to HPC data storage management, including your strategies for capacity planning, performance management, and hardware management.
- 💡 Include any relevant diagrams or visual aids to illustrate your storage system architecture and explain your decision-making processes.
- 💡 Provide examples of your problem-solving skills by describing how you've diagnosed and resolved storage system issues in the past.
💵 Compensation & Benefits
Salary Range: The general salary range for this position is $60,000 - $150,000 per year, with the exact compensation determined by individual qualifications and applicable contract terms.
Benefits:
- 💡 Comprehensive, evolving benefits package designed to meet the diverse needs of employees, including health, dental, and vision insurance, retirement plans, and paid time off.
- 💡 Employee-focused culture that values open communication, continuous learning, and professional growth at all levels.
- 💡 Competitive salary and bonus structures that recognize and reward employee contributions and achievements.
- 💡 Opportunities for career advancement within the company, as Sabre Systems values internal promotions and encourages employees to develop their skills and take on new challenges.
Working Hours: Full-time employees typically work 40 hours per week, with flexible scheduling options available to accommodate individual needs and project requirements.
📝 Enhancement Note: While the salary range provided is a general guideline, the final compensation for this role may vary based on individual qualifications, experience, and applicable contract terms. Research has shown that the average salary for an HPC Engineer/Data Storage Engineer in the Arlington, Virginia area is approximately $90,000 - $130,000 per year, with experienced professionals earning up to $160,000 or more.
🎯 Team & Company Context
🏢 Company Culture
Industry: Sabre Systems, LLC operates in the defense and government services sectors, providing innovative technological solutions and services for Department of Defense, Federal Civilian, and commercial customers. This role is focused on supporting the ever-evolving areas of advanced communication technologies, cyber, systems and software engineering, and digital transformation.
Company Size: As a mid-sized company with over 35 years in business, Sabre Systems maintains small business values while offering the resources and opportunities of a larger organization. This size allows for a more personal and collaborative work environment, with a strong emphasis on employee growth and development.
Founded: Sabre Systems was founded in 1987 and has since grown to become a trusted provider of technology solutions and services to various government and commercial clients.
Team Structure:
- 💡 The HPC Data Storage Engineer will work closely with the HPC team, collaborating on storage management strategies, capacity planning, and performance optimization.
- 💡 The role will also interact with other technical teams, such as systems and software engineering, to ensure consistent storage management practices and contribute to overall system architecture decisions.
- 💡 Sabre Systems fosters a cross-functional and collaborative work environment, encouraging open communication and knowledge sharing between teams.
Development Methodology:
- 💡 Sabre Systems follows Agile development methodologies, with a focus on iterative development, continuous improvement, and customer satisfaction.
- 💡 The company emphasizes code reviews, testing, and quality assurance practices to ensure the delivery of high-quality, reliable, and secure technology solutions.
- 💡 Sabre Systems employs CI/CD pipelines and automated deployment strategies to streamline the software development lifecycle and improve deployment efficiency.
Company Website: Sabre Systems, LLC
📝 Enhancement Note: Sabre Systems, LLC values diversity, inclusion, and equal opportunity, actively seeking to create an inclusive work environment that reflects the diverse perspectives and backgrounds of its employees and customers. The company is committed to providing a safe and respectful workplace for all employees, free from discrimination and harassment.
📈 Career & Growth Analysis
Web Technology Career Level: This role is at the mid/senior level, requiring a high degree of technical expertise and experience in managing and optimizing HPC data storage systems. The ideal candidate will have a strong background in Linux system administration, data storage administration, and scripting, with a focus on the Lustre filesystem.
Reporting Structure: The HPC Data Storage Engineer will report directly to the HPC Team Lead or a similar role within the organization, depending on the specific team structure. The role may also interact with other technical team leads and management to ensure consistent storage management practices and contribute to overall system architecture decisions.
Technical Impact: The HPC Data Storage Engineer will have a significant impact on the performance, availability, and security of the HPC data storage systems, directly contributing to the success of the organization's projects and clients. The role will also influence the development and implementation of storage management strategies, capacity planning, and performance optimization.
Growth Opportunities:
- 💡 Technical specialization in HPC data storage management, with opportunities to develop expertise in emerging technologies and advanced file system concepts.
- 💡 Technical leadership roles, such as Senior HPC Data Storage Engineer or HPC Team Lead, with responsibilities for mentoring junior team members, driving technical decisions, and contributing to overall system architecture.
- 💡 Career progression into related roles, such as Senior Systems Engineer, Technical Lead, or other senior technical positions, based on individual skills, interests, and career goals.
📝 Enhancement Note: Sabre Systems, LLC offers a supportive and collaborative work environment that encourages employee growth and development. The company provides opportunities for employees to take on new challenges, expand their skill sets, and advance their careers within the organization.
🌐 Work Environment
Office Type: Sabre Systems, LLC operates a hybrid work environment, with employees working both on-site and remotely, depending on the specific role and project requirements. The HPC Data Storage Engineer role requires on-site presence in Arlington, Virginia, to ensure close collaboration with the HPC team and access to the relevant hardware and infrastructure.
Office Location(s): Sabre Systems, LLC has offices in Arlington, Virginia, and other locations across the United States. The HPC Data Storage Engineer role is based in Arlington, Virginia, with opportunities for remote work on a case-by-case basis.
Workspace Context:
- 💡 The HPC Data Storage Engineer will work in a collaborative, open-plan office environment, sharing space with other members of the HPC team and other technical teams.
- 💡 The role will have access to relevant hardware, software, and infrastructure, including high-performance computing systems, storage systems, and networking equipment.
- 💡 Sabre Systems, LLC provides employees with the tools and resources they need to succeed, including modern workstations, multiple monitors, and testing devices, as well as access to relevant software and applications.
Work Schedule: The HPC Data Storage Engineer role follows a standard full-time work schedule, with flexible scheduling options available to accommodate individual needs and project requirements. The role may require occasional overtime or on-call responsibilities to ensure the availability and performance of the HPC data storage systems.
📝 Enhancement Note: Sabre Systems, LLC values work-life balance and offers flexible scheduling options to support the well-being and productivity of its employees. The company encourages employees to prioritize their personal and professional development and provides opportunities for growth and advancement within the organization.
📄 Application & Technical Interview Process
Interview Process:
- 💡 Initial screening: A brief phone or video call to assess the candidate's qualifications, experience, and cultural fit.
- 💡 Technical assessment: A hands-on, in-depth technical interview focusing on the candidate's knowledge of HPC data storage management, Linux system administration, and scripting. The interview may include questions on capacity planning, performance management, and hardware management, as well as problem-solving scenarios and coding challenges.
- 💡 Final round: An in-depth discussion with the HPC Team Lead or a similar role within the organization, focusing on the candidate's technical expertise, cultural fit, and career goals.
Portfolio Review Tips:
- 💡 Highlight your experience with HPC data storage management and provide specific examples of your achievements in capacity planning, performance management, and hardware management.
- 💡 Include any relevant metrics and performance reports you've developed to demonstrate your ability to track and optimize storage system performance.
- 💡 Showcase your scripting skills by providing examples of scripts you've written to automate tasks and improve storage system efficiency.
- 💡 Tailor your portfolio to the specific requirements of the role, emphasizing your experience with the Lustre filesystem, Linux system administration, and data storage administration.
Technical Challenge Preparation:
- 💡 Brush up on your knowledge of HPC data storage management, focusing on capacity planning, performance management, and hardware management.
- 💡 Review your experience with the Lustre filesystem and Linux system administration, ensuring you're familiar with the relevant concepts and best practices.
- 💡 Practice problem-solving scenarios and coding challenges related to HPC data storage management, focusing on your ability to diagnose and resolve storage system issues.
- 💡 Familiarize yourself with the Sabre Systems, LLC company culture and values, ensuring you can articulate how your skills and experiences align with the organization's mission and goals.
ATS Keywords: [Comprehensive list of HPC data storage management, Linux system administration, and scripting-relevant keywords for resume optimization, organized by category: HPC Data Storage, Linux System Administration, Scripting, Data Storage Administration, Networking, Security, and Soft Skills]
📝 Enhancement Note: Sabre Systems, LLC uses an Applicant Tracking System (ATS) to manage job applications and candidate screening. Including relevant keywords in your resume and portfolio can help ensure that your application is properly matched with the HPC Data Storage Engineer role and increases your chances of being selected for an interview.
🛠 Technology Stack & Web Infrastructure
Frontend Technologies: [Not applicable for this role]
Backend & Server Technologies:
- 💡 Lustre filesystem administration: Proven expertise in managing and optimizing the Lustre filesystem in a production environment.
- 💡 Linux system administration: Extensive knowledge of building, configuring, and administering production Linux computer/storage systems.
- 💡 Scripting: Proficiency in scripting with Bash, Perl, Python, or similar languages to automate tasks and improve efficiency.
- 💡 Storage system hardware: Familiarity with storage system hardware and relevant protocols, such as SAN infrastructure.
- 💡 Networking: Working knowledge of networking concepts and practices, including relevant protocols and technologies.
- 💡 Relational database systems: Experience working with relational database systems and relevant management tools.
Development & DevOps Tools:
- 💡 Ansible: Familiarity with Ansible for infrastructure automation and deployment.
- 💡 Git: Experience with Git for version control and collaborative development.
- 💡 Low-level system administration tools: Proficiency in using tools such as perf, strace, tcpdump, and vmstat to diagnose system software problems and optimize performance.
📝 Enhancement Note: The technology stack for this role is focused on HPC data storage management, Linux system administration, and scripting. Familiarity with the relevant tools, technologies, and best practices is essential for success in this role.
👥 Team Culture & Values
Web Development Values:
- 💡 Technical excellence: A commitment to delivering high-quality, reliable, and secure technology solutions, with a focus on continuous learning and improvement.
- 💡 Performance optimization: A dedication to optimizing the performance of HPC data storage systems, with a focus on capacity planning, performance management, and hardware management.
- 💡 User-focused design: An understanding of the needs and priorities of the organization's clients and users, with a commitment to delivering solutions that meet their specific requirements.
- 💡 Collaboration and knowledge sharing: A commitment to working collaboratively with other technical teams and sharing knowledge and expertise to improve the overall performance and security of the organization's technology infrastructure.
Collaboration Style:
- 💡 Cross-functional integration: Sabre Systems, LLC fosters a collaborative and cross-functional work environment, encouraging open communication and knowledge sharing between teams.
- 💡 Code review culture: The company emphasizes code reviews, testing, and quality assurance practices to ensure the delivery of high-quality, reliable, and secure technology solutions.
- 💡 Knowledge sharing and mentoring: Sabre Systems, LLC encourages employees to share their knowledge and expertise with their colleagues, fostering a culture of continuous learning and professional development.
📝 Enhancement Note: Sabre Systems, LLC values diversity, inclusion, and equal opportunity, actively seeking to create an inclusive work environment that reflects the diverse perspectives and backgrounds of its employees and customers. The company is committed to providing a safe and respectful workplace for all employees, free from discrimination and harassment.
⚡ Challenges & Growth Opportunities
Technical Challenges:
- 💡 Managing and optimizing HPC data storage systems in a dynamic and evolving environment, with a focus on capacity planning, performance management, and hardware management.
- 💡 Troubleshooting and resolving storage system issues in a timely and efficient manner, with a focus on minimizing downtime and ensuring high availability.
- 💡 Collaborating with other technical teams to ensure consistent storage management practices and contribute to overall system architecture decisions.
- 💡 Staying up-to-date with emerging technologies and best practices in HPC data storage management, with a focus on continuous learning and professional development.
Learning & Development Opportunities:
- 💡 Technical specialization in HPC data storage management, with opportunities to develop expertise in emerging technologies and advanced file system concepts.
- 💡 Technical leadership roles, such as Senior HPC Data Storage Engineer or HPC Team Lead, with responsibilities for mentoring junior team members, driving technical decisions, and contributing to overall system architecture.
- 💡 Career progression into related roles, such as Senior Systems Engineer, Technical Lead, or other senior technical positions, based on individual skills, interests, and career goals.
- 💡 Opportunities for professional development and training, including conference attendance, certification, and community involvement, to support the continuous learning and growth of employees.
📝 Enhancement Note: Sabre Systems, LLC offers a supportive and collaborative work environment that encourages employee growth and development. The company provides opportunities for employees to take on new challenges, expand their skill sets, and advance their careers within the organization.
💡 Interview Preparation
Technical Questions:
- 💡 HPC data storage management: Questions focusing on capacity planning, performance management, and hardware management, as well as problem-solving scenarios and coding challenges related to HPC data storage systems.
- 💡 Linux system administration: Questions assessing the candidate's knowledge of Linux system administration, including system configuration, user management, and security best practices.
- 💡 Scripting: Questions evaluating the candidate's proficiency in scripting with Bash, Perl, Python, or similar languages, with a focus on automation, efficiency, and problem-solving.
- 💡 Storage system hardware: Questions exploring the candidate's familiarity with storage system hardware and relevant protocols, such as SAN infrastructure, with a focus on hardware management and optimization.
Company & Culture Questions:
- 💡 Company culture and values: Questions designed to assess the candidate's understanding of Sabre Systems, LLC's company culture, values, and mission, as well as their ability to contribute to a positive and inclusive work environment.
- 💡 Team dynamics and collaboration: Questions focusing on the candidate's experience working in a collaborative and cross-functional team environment, with a focus on open communication, knowledge sharing, and technical mentoring.
- 💡 Problem-solving and decision-making: Questions evaluating the candidate's ability to analyze complex technical challenges, make informed decisions, and drive technical solutions that meet the organization's goals and objectives.
Portfolio Presentation Strategy:
- 💡 Highlight your experience with HPC data storage management and provide specific examples of your achievements in capacity planning, performance management, and hardware management.
- 💡 Include any relevant metrics and performance reports you've developed to demonstrate your ability to track and optimize storage system performance.
- 💡 Showcase your scripting skills by providing examples of scripts you've written to automate tasks and improve storage system efficiency.
- 💡 Tailor your portfolio to the specific requirements of the role, emphasizing your experience with the Lustre filesystem, Linux system administration, and data storage administration.
📝 Enhancement Note: Sabre Systems, LLC values a well-prepared and confident candidate who can articulate their technical expertise, cultural fit, and career goals clearly and effectively. The company's interview process is designed to assess the candidate's technical skills, problem-solving abilities, and cultural fit, with a focus on finding the best match for the organization's needs and the candidate's career aspirations.
📌 Application Steps
To apply for this HPC Engineer/Data Storage Engineer position:
- 📝 Review the job description thoroughly, ensuring you understand the role's requirements, responsibilities, and qualifications.
- 📝 Tailor your resume to highlight your relevant experience, skills, and achievements in HPC data storage management, Linux system administration, and scripting.
- 📝 Prepare a comprehensive portfolio demonstrating your experience with HPC data storage management, including case studies, metrics, and performance reports, as well as examples of your scripting skills and expertise with the Lustre filesystem and Linux system administration.
- 📝 Research Sabre Systems, LLC and the specific team you'll be working with, ensuring you understand the company's mission, values, and culture, as well as the team's dynamics and collaboration style.
- 📝 Practice technical interview questions and coding challenges related to HPC data storage management, Linux system administration, and scripting, focusing on your ability to diagnose and resolve storage system issues, as well as your knowledge of capacity planning, performance management, and hardware management.
- 📝 Prepare for behavioral and cultural fit interviews, focusing on your ability to collaborate effectively with other technical teams, share knowledge and expertise, and contribute to a positive and inclusive work environment.
- 📝 Submit your application through the Sabre Systems, LLC careers website, following the instructions provided.
⚠️ Important Notice: Sabre Systems, LLC uses an Applicant Tracking System (ATS) to manage job applications and candidate screening. Including relevant keywords in your resume and portfolio can help ensure that your application is properly matched with the HPC Engineer/Data Storage Engineer role and increases your chances of being selected for an interview.
Application Requirements
Candidates must have at least 3 years of related experience and knowledge of the Lustre filesystem. Practical experience in scripting and broad knowledge of data storage administration is also required.