Senior Hpc Engineer (Remote)

Salary: Competitive Salary
Job Type: Full time
Experience: Senior Level

Engtal

Senior Hpc Engineer (Remote)

Senior Hpc Engineer | Engtal | United States

Join a leading proprietary trading firm that leverages advancedtechnology and innovative trading strategies to excel in global financialmarkets. The firm is known for its entrepreneurial culture, collaborativeenvironment, and emphasis on cutting-edge technology to gain a competitive...

Senior Hpc Engineer | Engtal | United States

Join a leading proprietary trading firm that leverages advanced technology and innovative trading strategies to excel in global financial markets. The firm is known for its entrepreneurial culture, collaborative environment, and emphasis on cutting-edge technology to gain a competitive edge.

Job Overview:

As a Senior HPC Engineer, you will be responsible for architecting, deploying, and maintaining high-performance computing (HPC) environments that power the firm’s low-latency trading operations. You will collaborate with traders, developers, and infrastructure teams to design highly efficient systems and maximize trading performance. This role is ideal for a hands-on professional who thrives in a fast-paced, technically challenging environment.

Key Responsibilities:

  • Architecture & Design: Develop and implement high-performance computing solutions that cater to the low-latency and high-throughput needs of trading strategies. Evaluate and select appropriate hardware, storage solutions, and networking equipment to ensure optimal performance.
  • Performance Optimization: Analyze and optimize system performance, including compute, network, and storage layers. Fine-tune operating systems, firmware, and applications to achieve ultra-low latency and high availability.
  • Systems Engineering: Design, build, and maintain scalable and robust HPC clusters, including server configurations, networking, and storage architectures. Manage and configure GPU/FPGA-based systems where applicable.
  • Monitoring & Automation: Develop monitoring and alerting systems to proactively detect issues, maintain system health, and ensure high availability. Automate recurring tasks and configuration management using tools like Ansible, Puppet, or Chef.
  • Collaboration & Support: Work closely with traders, quantitative researchers, and software engineers to understand computational needs and deliver tailored HPC solutions. Provide support for system issues and participate in on-call rotations as needed.
  • Scripting & Development: Create and maintain scripts to manage system configurations, deployments, and backups. Develop automation tools and services to enhance the reliability and efficiency of HPC operations.
  • Security & Compliance: Ensure all systems are compliant with security policies, including access controls and patch management. Stay updated with the latest developments in cybersecurity as they relate to HPC environments.

Requirements:

  • Bachelor’s, Master’s, or PhD degree in Computer Science, Electrical Engineering, or a related field
  • 5+ years of experience in HPC architecture, system design, or a similar role within a large-scale compute environment
  • HPC system design and optimization
  • Parallel computing
  • Linux systems administration and enterprise storage solutions (e.g., Vast, DDN, Isilon)
  • HPC management tools (e.g., Kubernetes, Docker, Slurm)
  • High-performance processors and compute offload devices (e.g., GPUs, FPGAs)
  • Low-latency network architecture, including high-speed interconnects (e.g., InfiniBand, Ethernet)
  • Datacenter design and optimization
  • AI/ML frameworks and their integration into HPC systems
  • Programming languages such as Python, Bash, C++, or similar
  • Exceptional communication skills, with the ability to translate complex technical concepts for non-technical audiences
  • Proven ability to influence decision-making and align global teams to achieve a unified vision

Preferred Experience:

  • Experience in a financial trading environment, particularly with low-latency trading systems.
  • Familiarity with GPU/FPGA computing for accelerated workloads.
  • Knowledge of containerization technologies such as Docker and orchestration with Kubernetes.
  • Understanding of cybersecurity principles and best practices in HPC environments.

Show more

Show less

Tagged as: remote, remote job, virtual, Virtual Job, virtual position, Work at Home, work from home

Load more listings
When applying state you found this job on Pangian.com Remote Network.