Senior Cloud Systems Engineer
Posted on Nov 5, 2020 by Alarm.com
The Cloud Systems Engineer collaborates with the Cloud Architecture and engineering teams in the implementation and operation of Alarm.com's multi-site data center infrastructure. This position works closely with the DevOps, Network Operations, Software Engineering and Quality Engineering teams in meeting and exceeding Alarm.com's internal and external SLAs around maximum availability, security, recover-ability and compliance. This includes monitoring, validating changes, gathering and reporting metrics, testing, incident management, and change management.
The Cloud Systems Engineer's primary job responsibilities will include:
Designing, implementing and automating cloud agnostic solutions for high availability/scalability and performance.
Monitoring, maintaining and updating hardened configurations and baselines
Implementing availability and performance monitoring frameworks
Implementing and testing system level High Availability and Disaster Recovery plans
Tracking, controlling, and reporting status of system conditions, software, documentation, and infrastructure changes to management
Troubleshoot and remediate issues across the infrastructure and create plans to prevent the same problem in the future
Implementing and maintaining configuration and automation frameworks to prevent drift and sustain company growth
Develop and maintain automation scripts to simplify and quicken deployments and modifications.
Providing high quality support to customers, prospects, management and peers
Develop and implementing capacity planning methodologies and operational reporting
BS in Computer Science or related field
7+ years relevant work experience in private/public cloud, SAN and networking
5+ years of experience in database implementations supporting Microsoft SQL Server Experience in data center operations
Working knowledge of hyper-converged infrastructure
Experience supporting multi-site and hot-hot architectures
Working knowledge of Windows Server operating systems is a must
Basic understanding of Linux operating systems is desirable.
Experience with VMware data center stack; vSphere, vSAN, NSX, vRealize and vCenter
Experience of NAS and SAN based storage solutions
Experience implementing and maintaining monitoring frameworks on both commercial off-the-shelf and open source
Experience implementing and updating configuration management frameworks and applications: Ansible, Puppet, Chef, etc.
Relevant certification accreditations on the different components of the Cloud is highly desirable: Network (Cisco, SourceFire, Palo Alto, F5 appliances), Converged infrastructure (Cisco UCS, Flexpod), Virtualization (Vmware) and Storage (NetApp)
Familiar with multi-tiered escalation and on call procedures
Experience working in SOX, FISMA, HIPAA and PCI compliant multi-tenant environments
Ability to work collaboratively within a team environment
Self-directed approach with high degree of initiative to propose new solutions, troubleshoot and resolve issues
On call availability