Overview
Cloud Operations Engineer – Nashville, 37201, United States of America
How we LEAD:
We are currently seeking an eager and collaborative individual to serve as the technical resource on CloudOps team. Responsible to provide technical support and guidance regarding cloud-based SAP solutions as well a being a key technical support resource in supporting the greater cloud landscape.
How you’ll CREATE:
The responsibilities of this position include, but are not limited to, the following functions in the Operations organization:
-
Utilize technical skills and tools to coordinate, maintain, enhance, and deploy cloud-based solutions while following corporate standards and best practices
-
Work within a CloudOps team to meet project objectives in a timely fashion
-
Update project status reports (progress/risks/issues/roadblocks) as required by Senior Management
-
Use CloudOps methodology and toolsets to implement and support solutions in AWS, Azure, and GCP
-
Implement proposed solutions to ensure the delivery of a quality product or service
-
Ensure cloud resources meet UMG’s operational requirements and are in compliance with UMG infrastructure and security standards
-
Provide documentation and communication to peers and other IT Teams for status, coordination, objectives, and performance
-
Compile and remediate vulnerabilities identified by the securities team
-
Assess and troubleshoot, consult with vendors, and coordinate with other teams for problem resolution
-
Ensure high work standards regarding incident and change management tasks
-
Other tasks as deemed necessary or appropriate
Skills/Abilities:
Critical and Required Daily Skills
-
SUSE Linux Enterprise Server (SLES) Administration: Deploy, configure, and manage SLES EC2 instances in AWS; troubleshoot boot issues; manage LVM (LVS, VGS, PVS); configure SUID, SGID, Sticky Bit; expand XFS file systems and EBS volumes; manage permissions and ACLs. User/group management, SSH security, package management (zypper, rpm), process/service management (systemd), log analysis, networking (TCP/IP, DNS, routing, firewalls), backup/restore, patching and security hardening.
-
AWS Services: EC2, S3, EFS, Storage Gateway, EBS, VPC, VPC Endpoints, IAM, CloudWatch, CloudTrail; provisioning, configuration, monitoring and optimization.
-
Automation & Scripting: Bash/shell scripting, AWS CLI, Infrastructure-as-Code (Terraform/Ansible), cron jobs and system automation.
-
Monitoring & Performance: System resource monitoring, capacity planning, AWS monitoring tools, and troubleshooting performance bottlenecks.
Overall Skillset
-
Proficient usage and management of Linux and Windows technology stacks (Ubuntu, Windows Server, Nginx, Apache, MySQL, PHP, IIS, MS SQL, .NET, and more)
-
Proven experience with AWS, GCP, and Azure in an enterprise setting (personal use of these services will not be considered as equivalent)
-
Experienced usage of Kubernetes technology stacks (K8s, EKS, GKE, Helm, Prometheus, Cortex, Grafana, Istio, and more)
-
Accomplished with Modern CloudOps methodologies and toolsets (Chef, Terraform, Jira, Slack, Vault, Python, Cortex)
-
Solid grasp and usage of version control and release management concepts and tools, particularly Git (Github / SVN/Bitbucket) for code branching and merging
-
Understanding of Modern Auth (OIDC, SSO, SAML, Federation)
-
Build and support solutions using both serverless and server-based infrastructure in AWS, Azure, and GCP cloud environments
-
Follow deployment practices using CICD Processes and Technologies (Jenkins, TeamCity, Tekton, Spinnaker, Octopus, CodeDeploy, Automate)
-
Solid understanding of key cloud design concepts such as “High Availability” (HA), “Elastic Load Balancing” (ELB), Principle of Least Privilege, Resiliency, Ephemeral Computing, Stateless Computing, Virtual Networking, and Scaling
-
Detailed knowledge and demonstrated experience with key AWS specific technologies including EC2, S3, RDS, Redshift, CloudWatch, CloudFormation, CloudTrail, Storage Gateway, VPC, Transit Gateway, Dedicated Instances, Large Memory U-Type Instances, and Lambda
-
Detailed knowledge and demonstrated experience with key GCP specific technologies including Compute Engine, BigQuery, IAM, BigTable, VPC, Shared VPC, Flow Logs, and Cloud Functions
-
Depth of knowledge of VPC, Cloud Networking, Hybrid Cloud, and IAM constructs in AWS, Azure, and GCP
-
Clear understanding that implementation of security principles and tools is not “optional”
-
Experience with serverless technologies (K8s, RDS, Lambda, SNS, SQS, S3, BigQuery, EKS, GKE, BigTable, Azure AD, Azure B2C, API Gateway, CloudFront, Lambda at Edge)
-
Skill configuring and administering different distributions of Linux (Amazon Linux, Ubuntu, Red Hat Enterprise Linux, SLES)
Bring your VIBE:
Experience:
-
5 years minimum Experience in delivering IT infrastructure and cloud projects with multiple stakeholders and delivery teams.
-
Extensive experience in technical server and Enterprise storage management
-
Practical (hands-on) Experience with all 3 leading cloud providers (AWS, Azure, GCP)
-
Practical (hands-on) Experience with UMG primary DevOps toolsets (Chef, Terraform Enterprise, Jira, Confluence, Git, Vault, Slack)
-
Practical (hands-on) Experience with UMG primary serverless technologies (k8s, GKE, EKS, Lambda, RDS)
-
Practical (hands-on) experience with UMG primary VM technologies (EC2, Dedicated Instances, Compute Engine)
-
Practical (hands-on) experience using scripting languages (Python, PowerShell, Golang, AWS CLI, GCP CLI, Bash)
-
Practical (hands-on) experience using AWS Linux, Ubuntu Linux, and SLES Linux
-
Practical (hands-on) experience using Windows
-
Practical (hands-on) experience using Kubernetes (K8s) and related toolsets (K8s, EKS, GKE, Grafana, Prometheus, Cortex)
-
Other tools: Dynatrace, Azure AD, Intune, SEP, Redlock, Prisma Cloud, SAP, GoAnywhere)
Education:
-
BA or BS degree in an IT related field or an equivalent combination of formal education and applicable experience required
-
MCSE, AWS, GCP and UNIX certifications strongly preferred