The Cloud Systems Engineer will manage, maintain, and support SOLV Energy’s Azure and AWS cloud-based infrastructure to ensure consistent, reliable, and secure operations, while maximizing the value of subscribed systems and services.
Requirements
- Assess current Azure and AWS instance and create continuous improvement roadmap.
- Define cloud systems strategy to maximize the return on IT investments, while meeting or exceeding uptime and performance expectations.
- Drive the adoption of best practices in cloud architecture and operations, ensuring high standards of performance and security.
- Design, plan, and implement Azure and AWS based systems and services in support of functional, storage, compute, data integration, and systems security initiatives.
- Collaborate with SecOps and IT Operations teams to ensure that new or updated solutions and services comply with the enterprise cyber security standards.
- Proactively identify and apply system updates to prevent issues, strengthen security, tune performance, automate tasks, and manage costs.
- Monitor Azure and AWS systems operations and address alerts, anomalies, and issues.
- Develop and support Disaster Recovery, Backup, and retention policies on Azure and AWS platforms.
- Maintain zero trust endpoint security with tools such as Microsoft Defender
- Develop and implement training programs for team members to enhance their skills and knowledge in Azure technologies
- Lead project planning and execution, ensuring timely delivery of cloud solutions and adherence to project timelines
- Manage cloud networking and security and monitor the logging of systems.
- Development and maintenance of IT policies and procedures.
- Ensure compliance with IT General Controls, provide needed information in support of audits and to substantiate process and controls compliance.
- Own and maintain enterprise monitoring and alerting platforms, including Zabbix and cloud‐native tools, to provide clear visibility into the health, performance, capacity, and availability of Azure and AWS environments.
- Build and support automation workflows using scripting and orchestration tools such as Ansible/AWX and operational runbooks to reduce manual effort, improve reliability, and streamline day‐to‐day operations.
- Identify and adopt practical AI‐assisted features within monitoring and automation tools to improve anomaly detection, alert quality, and operational insights, while ensuring decisions and remediation remain under engineering control.
Benefits
- medical, dental, vision, basic life and disability insurance
- 401(k) plan
- vacation, sick and holiday pay