As a lead engineer, you'll own the fundamental infrastructure components that all other product teams build upon, including our legacy pools, micro-services, API gateway, SQL/NoSQL data stores, caching, container orchestration, storage clusters, CI/CD pipelines, and infrastructure playbooks. This is an exciting time to join us as we are rapidly expanding our data center footprint and building out the next generation cloud infrastructure to support our strategic initiatives.REQUIREMENTS• Prior experience building and maintaining large scale cloud infrastructure• Hands-on with DevOps toolchains, best practices, and methodologies• 3 - 5+ years DevOps experience in supporting cloud SaaS products, with a total of 10+ years of experience in infrastructure management• Linux internals, filesystems, disk/storage technologies, and DB’s (SQL and NoSQL)• Hardening of Linux OS to the CIS benchmark standards• Linux OS level, DB, Middleware, other software patch management.• Networking skills, e.g., routing, switching, VLAN, Wireshark, VPN• DB performance and scalability – SQL Server, MySQL, and MongoDB• CI/CD pipelines using Git, Jenkins, Docker, Ansible, Kubernetes, etc.• Application benchmarking, capacity planning, and optimization• Monitoring tools for log analysis, preferably Prometheus and ELK• Solid programming or scripting skills with one or more languages• Well versed in SDLC, configurations, branching, releases, and environments• Well versed in ITSM processes and should have implemented the same within the team. PREFERRED• Bachelor’s or Master’s in Computer Science, IT or related field• Working experience with JIRA, Opsgenie integration, and incident response• Security: Remediation plans, CVE, OWASP top 10, IDS, and secure SDLC practices• Working experience with Juniper firewalls (rule creation, log monitoring, installation, configuration)• Prior experiences in managing on-prem data centers and have very deep knowledge & hands-on experience on Linux OS administration and management.• Prior experience working in projects for the US healthcare industry RESPONSIBILITIES• Take ownership of our build and deployment pipeline for applications and containers• Apply your Linux skills to maintain, extend, and scale our hybrid cloud platform• Identify and replace deficient technologies in our existing infrastructure• Build internal tools and cloud services to continuously improve developer workflow• Reduce firing up and provisioning clusters of servers and services to a single button press• Implement Infrastructure as Code practices in order to maintain reliable systems• Architect systems for security, reliability, availability, and scalability• Perform end-to-end security hardening as per our security policy requirements• Communicate infrastructure deliveries, rollouts, and details to stakeholders on a weekly basis• Collaborate with engineering, Ops, and security teams for successful releases• Manage and prioritize incident response to meet SLA targets and submit reports as needed• Conduct periodic reviews of our DR processes to ensure its readiness and effectiveness• Automate and Document everything from MOPs to best practices and scripts
As a lead engineer, you'll own the fundamental infrastructure components that all other product teams build upon, including our legacy pools, micro-services, API gateway, SQL/NoSQL data stores, caching, container orchestration, storage clusters, CI/CD pipelines, and infrastructure playbooks. This is an exciting time to join us as we are rapidly expanding our data center footprint and building out the next generation cloud infrastructure to support our strategic initiatives.REQUIREMENTS• Prior experience building and maintaining large scale cloud infrastructure• Hands-on with DevOps toolchains, best practices, and methodologies• 3 - 5+ years DevOps experience in supporting cloud SaaS products, with a total of 10+ years of experience in infrastructure management• Linux internals, filesystems, disk/storage technologies, and DB’s (SQL and NoSQL)• Hardening of Linux OS to the CIS benchmark standards• Linux OS level, DB, Middleware, other software patch management.• Networking skills, e.g., routing, switching, VLAN, Wireshark, VPN• DB performance and scalability – SQL Server, MySQL, and MongoDB• CI/CD pipelines using Git, Jenkins, Docker, Ansible, Kubernetes, etc.• Application benchmarking, capacity planning, and optimization• Monitoring tools for log analysis, preferably Prometheus and ELK• Solid programming or scripting skills with one or more languages• Well versed in SDLC, configurations, branching, releases, and environments• Well versed in ITSM processes and should have implemented the same within the team. PREFERRED• Bachelor’s or Master’s in Computer Science, IT or related field• Working experience with JIRA, Opsgenie integration, and incident response• Security: Remediation plans, CVE, OWASP top 10, IDS, and secure SDLC practices• Working experience with Juniper firewalls (rule creation, log monitoring, installation, configuration)• Prior experiences in managing on-prem data centers and have very deep knowledge & hands-on experience on Linux OS administration and management.• Prior experience working in projects for the US healthcare industry RESPONSIBILITIES• Take ownership of our build and deployment pipeline for applications and containers• Apply your Linux skills to maintain, extend, and scale our hybrid cloud platform• Identify and replace deficient technologies in our existing infrastructure• Build internal tools and cloud services to continuously improve developer workflow• Reduce firing up and provisioning clusters of servers and services to a single button press• Implement Infrastructure as Code practices in order to maintain reliable systems• Architect systems for security, reliability, availability, and scalability• Perform end-to-end security hardening as per our security policy requirements• Communicate infrastructure deliveries, rollouts, and details to stakeholders on a weekly basis• Collaborate with engineering, Ops, and security teams for successful releases• Manage and prioritize incident response to meet SLA targets and submit reports as needed• Conduct periodic reviews of our DR processes to ensure its readiness and effectiveness• Automate and Document everything from MOPs to best practices and scripts