Currently we're investigating a bug where the answers are not recorded properly. If you encounter this bug, please email us at rfqjobs@gmail.com.
DevOps Engineer, High Performance Computing
(more about Tower Research)
This job posting has been archived.
Unfortunately Tower Research has archived this job posting, they are no longer accepting applications.

DevOps Engineer, High Performance Computing

Tower Research Capital LLC, a high-frequency proprietary trading firm founded in 1998, seeks a skilled DevOps Engineer to join our Server Reliability Engineering division on the High Performance Computing team.

The SRE division manages engineering, deployment and operation of the Trading infrastructure, with a focus on performance optimization, monitoring, automation and capacity. The team ensures that the optimal levels of systems and infrastructure are available, and preemptively resolves the issues. 

The High Performance Computing team is responsible for developing and supporting the large scale compute and storage platform designed to solve demanding financial and business problems through computer modeling, simulation and analysis.

Responsibilities

  • Developing and enhancing Tower’s HPC infrastructure stack - compute, storage, networking, automation and monitoring
  • Guiding platform users in designing, building, testing and deploying changes to existing software for the HPC on-premises and cloud-based environments
  • Developing and operating company’s containerized applications environment (Docker, bare-metal Kubernetes and GKE)
  • Developing and operating the company's cloud infrastructure (GCP, AWS)
  • Developing, maintaining and improving HPC workload management software (HTCondor)
  • Developing, maintaining and improving HPC custom management tools, services, and SDKs
  • Developing metric collection capabilities, analyzing results and using them to improve HPC clusters resource utilization and performance
  • Managing code deployments, fixes, updates and related processes.
  • Updating system processes and designing new processes as needed. Identify manual processes that can be automated and help with their automation

Qualifications

  • 5-7 years of experience in software engineering, development operations, or Linux administration, with at least 1 year of Kubernetes DevOps experience
  • Strong programming experience, with a preference for Go and Python
  • Experience in building RESTful applications
  • Experience in writing SQL queries
  • Understanding of containerization, container networking, and Kubernetes
  • Experience in continuous integration/continuous deployment and infrastructure-as-code
  • Experience with at least one cloud platform (GCP, AWS or Azure)
  • Working knowledge of systems or network automation, monitoring, and alerting
  • Working knowledge of some configuration management system (such as Salt or Ansible)
  • Demonstrable troubleshooting expertise and interest, a desire to automate and focus on end-user experience

Benefits

Tower’s Canada office is located in Downtown Montreal. While we work hard, Tower’s cubicle-free workplace, jeans-clad workforce, and well-stocked kitchens reflect the premium the firm places on quality of life. Benefits include:

  • Competitive salary and performance-based bonuses
  • 5 weeks of paid vacation per year
  • Lunch and snacks on a daily basis
  • Reimbursement for health and wellness expenses
  • Free events and workshops
  • Donation matching program

Tower Research Capital is an equal opportunity employer.