Role: Cloud Infra SME - Terraforms
GKE
Location: Plano, TX
Role Summary:
The Cloud Administrator SME (Onshore) is a critical, customer-facing technical expert responsible for the 24x7 operational health, effective L2 incident management, and onsite support of our hybrid, multi-cloud infrastructure supporting mission-critical Healthcare systems like EPIC Electronic Health Record (EHR) workloads across Azure, GCP, and OCI. This role requires hands-on expertise in IaC, container orchestration (GKE), and ensuring absolute compliance with HIP nd other healthcare regulations, with a strong focus on high-touch coordination during major hospital events.
Key Responsibilities:
Serve as the primary L2/L3 Onshore technical expert for all cloud infrastructure and EPIC workload incidents, ensuring rapid diagnosis and resolution to minimize impact on clinical operations.
Lead communications during active incidents, providing clear, concise, and professional updates to hospital IT teams, clinical stakeholders, and internal leadership.
Perform onsite, hands-on support during critical periods, including EPIC go-lives, major version upgrades, patching windows, and disaster recovery drills.
Coordinate directly with EPIC technical staff, hospital IT teams (network, security, application), and vendors to resolve complex cross-functional dependencies.
Daily administration, monitoring, and proactive remediation of IaaS/PaaS services across Azure, GCP, and Oracle Cloud Infrastructure (OCI) supporting EPIC environments.
Operational expertise with Google Kubernetes Engine (GKE): Manage the deployment, scaling, health, and operational troubleshooting of containerized EPIC components or supporting microservices.
Perform resource provisioning, capacity monitoring, and infrastructure tagging for cost management and chargebacks across all multi-cloud tenants.
Maintain and validate IAM, backup, and geo-redundant Disaster Recovery (DR)/Business Continuity Planning (BCP) mechanisms for EPIC and integrated hospital systems.
Implement and enforce security baselines and compliance controls (e.g., HIPAA, SOC2, CIS, NIST) at the infrastructure layer across Azure, GCP, and OCI.
Drive automation efforts for routine operational tasks and provisioning using Terraform for IaC and scripting tools (PowerShell, Python) to ensure operational consistency and auditability.
Manage log monitoring, correlation, and initial incident response for security and performance events within the EPIC cloud environments.
Tools: Terraform, GitHub, Ansible, Tanium, PowerShell, YAML, Bash, Python.
Qualifications
Technical Skills:
Experience
Environment: 7-10 years in infrastructure operations, with at least 5 years dedicated to Cloud Administration and 5+ years supporting EPIC EHR or similar mission-critical clinical systems in a hybrid environment.
Multi-Cloud Hands-on Expertise: Proven expertise in the administration, configuration, and operational support of Microsoft Azure, Google Cloud Platform (GCP), and Oracle Cloud Infrastructure (OCI).
Containerization: Strong practical experience in the operational management, troubleshooting, and administration of Google Kubernetes Engine (GKE) clusters.
EPIC Systems Knowledge: Solid understanding of EPIC infrastructure requirements, deployment topologies (e.g., Clarity, Chronicles), and operational dependencies.
Automation
Scripting: Highly proficient with:
IaC: Terraform for multi-cloud deployments.
Scripting: PowerShell, Python, YAML, and Bash.
Compliance: Deep knowledge of HIPAA requirements and operational procedures necessary to maintain compliance in a cloud environment.
Communication
Coordination: Exceptional onsite communication, stakeholder management, and incident command skills required to effectively coordinate during critical hospital operations.
"*** is an Equal Employment Opportunity employer. We promote and support a diverse workforce at all levels of the company. All qualified applicants will receive consideration for employment without regard to race, religion, color, sex, age, national origin or disability. All applicants will be evaluated solely on the basis of their ability, competence, and performance of the essential functions of their positions with or without reasonable accommodations. Reasonable accommodations also are available in the hiring process for applicants with disabilities. Candidates can request a reasonable accommodation by contacting the company ADA Coordinator at
."
Role: Cloud Infra SME - Terraforms
GKE
Location: Plano, TX
Role Summary:
The Cloud Administrator SME (Onshore) is a critical, customer-facing technical expert responsible for the 24x7 operational health, effective L2 incident management, and onsite support of our hybrid, multi-cloud infrastructure supporting mission-critical Healthcare systems like EPIC Electronic Health Record (EHR) workloads across Azure, GCP, and OCI. This role requires hands-on expertise in IaC, container orchestration (GKE), and ensuring absolute compliance with HIP nd other healthcare regulations, with a strong focus on high-touch coordination during major hospital events.
Key Responsibilities:
Serve as the primary L2/L3 Onshore technical expert for all cloud infrastructure and EPIC workload incidents, ensuring rapid diagnosis and resolution to minimize impact on clinical operations.
Lead communications during active incidents, providing clear, concise, and professional updates to hospital IT teams, clinical stakeholders, and internal leadership.
Perform onsite, hands-on support during critical periods, including EPIC go-lives, major version upgrades, patching windows, and disaster recovery drills.
Coordinate directly with EPIC technical staff, hospital IT teams (network, security, application), and vendors to resolve complex cross-functional dependencies.
Daily administration, monitoring, and proactive remediation of IaaS/PaaS services across Azure, GCP, and Oracle Cloud Infrastructure (OCI) supporting EPIC environments.
Operational expertise with Google Kubernetes Engine (GKE): Manage the deployment, scaling, health, and operational troubleshooting of containerized EPIC components or supporting microservices.
Perform resource provisioning, capacity monitoring, and infrastructure tagging for cost management and chargebacks across all multi-cloud tenants.
Maintain and validate IAM, backup, and geo-redundant Disaster Recovery (DR)/Business Continuity Planning (BCP) mechanisms for EPIC and integrated hospital systems.
Implement and enforce security baselines and compliance controls (e.g., HIPAA, SOC2, CIS, NIST) at the infrastructure layer across Azure, GCP, and OCI.
Drive automation efforts for routine operational tasks and provisioning using Terraform for IaC and scripting tools (PowerShell, Python) to ensure operational consistency and auditability.
Manage log monitoring, correlation, and initial incident response for security and performance events within the EPIC cloud environments.
Tools: Terraform, GitHub, Ansible, Tanium, PowerShell, YAML, Bash, Python.
Qualifications
Technical Skills:
Experience
Environment: 7-10 years in infrastructure operations, with at least 5 years dedicated to Cloud Administration and 5+ years supporting EPIC EHR or similar mission-critical clinical systems in a hybrid environment.
Multi-Cloud Hands-on Expertise: Proven expertise in the administration, configuration, and operational support of Microsoft Azure, Google Cloud Platform (GCP), and Oracle Cloud Infrastructure (OCI).
Containerization: Strong practical experience in the operational management, troubleshooting, and administration of Google Kubernetes Engine (GKE) clusters.
EPIC Systems Knowledge: Solid understanding of EPIC infrastructure requirements, deployment topologies (e.g., Clarity, Chronicles), and operational dependencies.
Automation
Scripting: Highly proficient with:
IaC: Terraform for multi-cloud deployments.
Scripting: PowerShell, Python, YAML, and Bash.
Compliance: Deep knowledge of HIPAA requirements and operational procedures necessary to maintain compliance in a cloud environment.
Communication
Coordination: Exceptional onsite communication, stakeholder management, and incident command skills required to effectively coordinate during critical hospital operations.
"*** is an Equal Employment Opportunity employer. We promote and support a diverse workforce at all levels of the company. All qualified applicants will receive consideration for employment without regard to race, religion, color, sex, age, national origin or disability. All applicants will be evaluated solely on the basis of their ability, competence, and performance of the essential functions of their positions with or without reasonable accommodations. Reasonable accommodations also are available in the hiring process for applicants with disabilities. Candidates can request a reasonable accommodation by contacting the company ADA Coordinator at
."
Government Careers
Government jobs offer stability, competitive benefits, and the chance to make a meaningful impact on your community and country.
Whether you’re starting your career or seeking new opportunities, these roles provide pathways for growth, security, and service.
Explore positions across a wide range of fields and take the first step toward a rewarding future in public service.
MORE JOBS
-
Field Services Engineer, Cyber Ops Deployment ( )
- Wausau, Wisconsin
- DataMinr
- Jun 25, 2026
-
Senior Intelligence Analyst (National Security)
- Fairfax, Virginia
- Bridge Defense
- Jun 25, 2026
-
Corporate Counsel: Procurement & Gov Contracts (Remote)
- Anchorage, Alaska
- 110 Alaska Communications Systems Holdings
- Jun 25, 2026
-
DoD and Federal Partner Manager
- Mc Lean, Virginia
- thejosefgroup.com
- Jun 25, 2026
-
Senior Animal Control Officer
- Rocklin, California
- GovernmentJobs.com
- Jun 25, 2026
-
External Affairs Lead - Policy & Coalitions (DC Onsite)
- Washington, DC
- Baron Public Affairs LLC
- Jun 25, 2026