Position Summary
We are seeking a motivated and detail-oriented Azure DevOps Engineer to join our remote infrastructure and operations team. The ideal candidate will be responsible for developing and managing monitoring and automation solutions to ensure high availability, reliability, and performance of Azure-based services. This role also involves monitoring customer-reported infrastructure issues, troubleshooting incidents, and collaborating with engineering teams to maintain operational excellence.
Key Responsibilities
- Develop, implement, and maintain monitoring, alerting, and automation solutions for Azure cloud infrastructure.
- Ensure high availability, reliability, and performance of Azure services through proactive monitoring and continuous improvement initiatives.
- Monitor customer support tickets related to infrastructure, cloud platform, networking, and system availability issues.
- Investigate, troubleshoot, and resolve infrastructure-related incidents within defined service-level objectives (SLOs).
- Perform root cause analysis for recurring issues and implement preventive measures.
- Automate operational tasks using scripting and Infrastructure-as-Code (IaC) tools.
- Configure and maintain Azure monitoring tools such as Azure Monitor, Log Analytics, Application Insights, and related services.
- Collaborate with software engineering, support, and security teams to deploy and maintain cloud infrastructure.
- Manage incident response, escalation, and post-incident documentation.
- Support CI/CD pipeline operations and deployment processes.
- Create and maintain operational runbooks, monitoring documentation, and troubleshooting guides.
- Participate in on-call support rotations as required.
Key Skills
- Microsoft Azure
- Azure Monitoring & Alerting
- Infrastructure Automation
- Incident Management
- Troubleshooting & Root Cause Analysis
- PowerShell / Python / Bash
- CI/CD Pipelines
- Azure DevOps
- Infrastructure as Code
- Cloud Operations
- Customer Support & Ticket Management
What Success Looks Like
- High availability and reliability of Azure-hosted services.
- Timely resolution of customer-reported infrastructure issues.
- Reduced manual operational effort through automation.
- Improved monitoring coverage, alert accuracy, and incident response times.
- Strong collaboration with engineering and support teams to maintain a stable cloud environment.
Qualifications
Required
- 2+ years of experience in any Software Engineering, Systems Engineering, Cloud Engineering, DevOps, Site Reliability Engineering, or related technical role.
- Bachelor’s degree in Computer Science, Information Technology, Engineering, or a related field, or equivalent practical experience.
- Minimum 2 years of experience in a software engineering or related technical role.
- Basic understanding of Microsoft Azure services and cloud infrastructure concepts.
- Experience with scripting languages such as PowerShell, Python, Bash, or similar.
- Knowledge of monitoring and observability concepts, including metrics, logs, dashboards, and alerting.
- Familiarity with infrastructure troubleshooting, networking fundamentals, and operating systems.
- Understanding of CI/CD concepts and DevOps practices.
- Strong analytical and problem-solving skills.
- Excellent communication and collaboration abilities in a remote work environment.
Preferred
- Master’s degree in Computer Science, Information Technology, Engineering, or a related field, or equivalent practical experience.
- Hands-on experience with Azure Monitor, Log Analytics, Application Insights, or similar monitoring platforms.
- Experience with Infrastructure as Code tools such as Terraform, Bicep, or ARM Templates.
- Knowledge of Azure DevOps, GitHub Actions, or other CI/CD platforms.
- Familiarity with container technologies such as Docker and Kubernetes.
- Azure certifications such as Azure Fundamentals (AZ-900), Azure Administrator (AZ-104), or Azure DevOps Engineer Expert (AZ-400).
Benefits
- Remote-first work environment within the United States
- A collaborative, engineering-focused culture with strong mentorship and learning opportunities
- Hands-on exposure to Azure, SaaS operations, automation, and reliability engineering
- A clear path to grow into cloud engineering, DevOps, or SRE roles over time
In addition to base salary, this role is eligible for the following:
- Bonus: Eligibility for an annual performance bonus
- Health & Wellness: Medical, dental, and vision insurance;
- Retirement: 401(k) with company match
- Time Off: Flexible PTO, paid holidays
Specific benefit eligibility and details will be confirmed during the recruiting process. Benefits may vary for part-time, contract, or temporary roles.
Ready to Apply?
To apply for this position, complete the application below, or use this link. We review every application and will be in touch if your background is a strong match.