Sr Software Engineer- Devops

Apply now »

Date: Sep 7, 2023

Location: Pune, MH, IN

Company: Houghton Mifflin Harcourt

Job Requisition ID: 18676

Additional Locations:  


“Houghton Mifflin Harcourt is a high-purpose, high-performing, leading K-12 EdTech company, powered by values and culture that support continuous learning and rapid innovation. Known for serving more than 50 million students and 4 million educators in 150+ countries, we provide learning to countless students and teachers, transforming lives, supporting communities, and making our society more open, just, and inclusive for all. HMH Technology India Pvt. Ltd. is our technology and innovation arm in India, focused on developing products and solutions using cutting-edge technology in order to better serve education globally. We aim to help employees grow as people, not just as professionals. We do work that matters and are in a unique position to support and empower people’s lives.”




Senior DevOps Engineer

HMH is a learning technology company committed to delivering connected solutions that engage learners, empower educators, and improve student outcomes. As a leading provider of K–12 core curriculum, supplemental and intervention solutions and professional learning services, HMH partners with educators and school districts to uncover solutions that unlock students' potential and extend teachers' capabilities. HMH serves more than 50 million students and 4 million educators in 150 countries.  

Technical Infrastructure: 

  • Here’s just some of what we use: 
  • AWS EC2, Terraform Enterprise, Docker, Aurora, Mesos, Kubernetes, ELK (Elastic Search, Logstash & Kibana)
  • Grafana, Prometheus, Datadog, Telegraf, Runscope, Apollo, GraphQL
  • Microservices Architecture, Spring, Java & NodeJS, React, Koa, Express.js
  • Amazon RDS, Dynamo DB, Postgres, Oracle, MySQL, Influx DB, Linux, Jenkins. GitHub

About the role:

You will constantly be asking; what are the most important infrastructure problems we need to solve for today that will increase our applications and infrastructures reliability and performance.

You will apply your deep technical knowledge, taking a broad look at our technology infrastructure. You’ll help us identify common and systematic issues and validate these, prioritizing which to strategically address first. 

We value collaboration. So, you will partner with our SRE/DevOps team, discussing and refining your ideas and preparing proof of concepts. You will present and validate these across technology teams, figuring out the best solution and you’ll be given ownership to engineer and implement your solutions. 

There’s lot of interesting technology problems for you to solve, so you are constantly applying latest thinking. These include, implementing Canary, designing a new automated pipeline solution, extension of Kubernetes capabilities, implementation of machine learning to build load testing, ensuring mutability of containerization etc. 

You will get to evaluate existing technologies and design the future state without being afraid to challenge the status quo. And you’ll regularly review existing infrastructure, looking for opportunities to improve (E.g. service improvements, cost reduction, security, performance). You’ll also get to automate everything necessary, combining reliability with a pragmatic approach, doing it right, first time.  

We’re continuing our journey of making our code and configuration deployments self-serve for our development teams.

  • You’ll help us build and maintain the right tooling and you’ll have ownership to design and implement the infrastructure needed
  • You’ll also be involved in the daily management of our AWS infrastructure. This means working with our Agile development teams, to troubleshoot server, application, and performance issues



Skills & Experience:

  • 3 to 6 years hands-on SRE/DevOps experience in an Agile environment
  • You’ll be able to collaborate effectively with both engineers and operations, and be comfortable recommending best practices
  • Substantial experience using AWS in a production environment
  • You have the expertise and skills to navigate the AWS ecosystem, and will know when and where to recommend the most appropriate service, and/or usage pattern
  • You have experienced resolving outages, and are able to quickly diagnose issues and been instrumental in restoring normal service levels
  • You have an intellectual curiosity, and an appetite to learn more



You’ll also have significant experience and/or an interest in the following:

  • Managing cloud infrastructure as code
  • Application Container Management 
  • Expertise with an RDBMS
  • Experience in how to tune, scale and how performance and reliability are achieved
  • Experience working with Linux
  • Experience with management of Messaging Queues and event driven systems
  • Experience working with firewalls, network and application load balancing & secret management
  • Experience working with CI/CD tools
  • Experience using scripting languages
  • A strong and informed point of view with respect to monitoring tools and how best to use them




Job Segment: Curriculum, Education, Publishing