Software Engineer - Devops

Apply now »

Date: Sep 16, 2023

Location: Pune, MH, IN

Company: Houghton Mifflin Harcourt

Job Requisition ID: 18307

Additional Locations:  


“Houghton Mifflin Harcourt is a high-purpose, high-performing, leading K-12 EdTech company, powered by values and culture that support continuous learning and rapid innovation. Known for serving more than 50 million students and 4 million educators in 150+ countries, we provide learning to countless students and teachers, transforming lives, supporting communities, and making our society more open, just, and inclusive for all. HMH Technology India Pvt. Ltd. is our technology and innovation arm in India, focused on developing products and solutions using cutting-edge technology in order to better serve education globally. We aim to help employees grow as people, not just as professionals. We do work that matters and are in a unique position to support and empower people’s lives.”




DevOps Engineer
HMH is a learning technology company committed to delivering connected solutions that
engage learners, empower educators, and improve student outcomes. As a leading provider
of K–12 core curriculum, supplemental and intervention solutions and professional learning
services, HMH partners with educators and school districts to uncover solutions that unlock
students' potential and extend teachers' capabilities. HMH serves more than 50 million
students and 4 million educators in 150 countries.

Technical Infrastructure:
• Here’s just some of what we use:
• AWS EC2, Terraform Enterprise, Docker, Aurora, Mesos, Kubernetes, ELK (Elastic Search,
Logstash & Kibana)
• Grafana, Prometheus, Datadog, Telegraf, Runscope, Apollo, GraphQL
• Microservices Architecture, Spring, Java & NodeJS, React, Koa, Express.js
• Amazon RDS, Dynamo DB, Postgres, Oracle, MySQL, Influx DB, Linux, Jenkins. GitHub

About the role:
You will constantly be asking; what are the most important infrastructure problems we need
to solve that will increase our applications and infrastructures reliability and performance.
You will apply your deep technical knowledge, taking a broad look at our technology


You’ll help us identify common and systematic issues and validate these,
prioritizing which to strategically address first. You will partner with our SRE/DevOps team,
discussing and refining your ideas and preparing proof of concepts. You will present and
validate these across technology teams, figuring out the best solution and you will be given
ownership to engineer and implement your solutions.

There are lots of interesting technology problems for you to solve so you will constantly
applying latest thinking. These include, implementing Canary, designing a new automated
pipeline solution, extension of Kubernetes capabilities, implementation of machine learning
to build load testing, ensuring mutability of containerization etc.

You will get to evaluate existing technologies and design the future state, without being
afraid to challenge the status quo. And you will regularly review existing infrastructure,
looking for opportunities to improve (E.g. Service improvement, cost reduction, security,
performance). You’ll also get to automate everything necessary, combining reliability with a
pragmatic approach.

We’re continuing our journey of making our code and configuration deployments self-serve
for our development teams.
• You’ll help us build and maintain the right tooling
• And you’ll have ownership to design and implement the infrastructure needed
• You’ll also be involved in the daily management of our AWS infrastructure
• This means working with our Agile development teams to troubleshoot server
application and performance issues


Skills and Experience:

• 3.5 to 4.5 years hands-on SRE/DevOps experience in an Agile environment
• Collaborate effectively with both engineers and operations, and be comfortable
recommending best practices
• Experience using AWS in a production environment
• Expertise and skills to navigate the AWS ecosystem and the ability to recommend the
most appropriate service, and/or usage pattern
• Experience resolving outages and the ability to quickly diagnose issues and been
instrumental in restoring normal service levels

Experience and/or an interest in the following;
• Managing cloud infrastructure as code
• Application Container Management
• Expertise with an RDBMS
• Experienced working with Linux
• Experience with management of Messaging Queues and event driven systems
• Having considered security, you have experience working with firewalls, network and
application load balancing and secret management
• Experience working with CI/CD tools
• Experience using scripting languages
• A strong and informed point of view with respect to monitoring tools and how best to
use them

Job Segment: Curriculum, Education, Publishing