hero

Careers in San Antonio, TX

Explore career opportunities within the San Antonio region.

Principal Architect - Cloud and Observability

CVSHealth

CVSHealth

IT
Remote
USD 144,200-288,400 / year + Equity
Posted on Apr 4, 2026

We’re building a world of health around every individual — shaping a more connected, convenient and compassionate health experience. At CVS Health®, you’ll be surrounded by passionate colleagues who care deeply, innovate with purpose, hold ourselves accountable and prioritize safety and quality in everything we do. Join us and be part of something bigger – helping to simplify health care one person, one family and one community at a time.

Position Summary

We're hiring a Principal Architect to take ownership of how we do observability and hybrid cloud at CVS Health. This person will sit within our Enterprise Architecture organization and be responsible for the architecture, standards, and technical direction behind our observability platforms and our multi-cloud infrastructure posture.

We run workloads across on-prem private cloud (OpenShift, KVM, Dell PowerFlex), Azure, AWS, and GCP. We need someone who can build and maintain the reference architectures, telemetry standards, and instrumentation patterns that let our engineering teams monitor all of that consistently. We've committed to an OpenTelemetry-first approach and use the Grafana stack (Mimir, Loki, Tempo) as our primary backends, but we also operate Datadog, Splunk, and Dynatrace in various parts of the org.

On the cloud side, there is real work to do around workload identity, runtime selection, autoscaling guidance, and FinOps. Teams are asking for concrete standards they can follow.

This is a hands-on role. You'll write architecture docs, build proof-of-concepts, configure OTel pipelines, and present to leadership.

*This position can work remotely from anywhere in the continental USA.

Responsibilities

Observability

  • Own the enterprise observability reference architecture covering metrics, logs, traces, and events across all environments (cloud and on-prem).
  • Drive the OpenTelemetry-first instrumentation strategy -- standard libraries, semantic conventions, collector topologies (DaemonSet, gateway, sidecar), and pipeline design.
  • Build and operate telemetry pipelines on Grafana Mimir, Loki, and Tempo, including multi-tenant configurations, retention policies, and capacity planning.
  • Define how we measure reliability: SLOs, SLIs, error budgets, and alerting frameworks -- consistently across all lines of business.
  • Own the integration between observability tooling and incident management (ServiceNow ITOM, xMatters).

Drive telemetry schema standards to ensure teams emit data that is useful downstream, not just technically compliant.

Hybrid Multi-Cloud

  • Build and maintain reference architectures for our hybrid footprint: OpenShift on-prem with KVM/libvirt and Dell PowerFlex storage, plus Azure, AWS, and GCP.
  • Lead standards work around workload identity and federation using SPIFFE/SPIRE and cloud-native IAM patterns to move away from static secrets.
  • Provide guidance on compute runtime selection -- containers vs. VMs vs. bare metal vs. serverless -- with a clear decision framework for teams.
  • Help teams connect autoscaling and capacity planning behavior to actual telemetry signals.

Push FinOps maturity forward by integrating cost data into the observability stack, establishing unit economics, and working toward open billing standards like FOCUS.

AI + Observability

  • Identify where AI/ML adds practical value in our observability stack -- anomaly detection, root cause analysis, log clustering, and smarter alerting.
  • Define observability standards for AI-powered systems (agents, RAG pipelines) -- covering latency, token costs, model drift, and related signals.

Ensure new AI-powered platforms are instrumented correctly from day one.

Architecture Community

  • Participate in cross-functional architecture working groups focused on observability and hybrid cloud standards.
  • Publish architecture decision records and reference implementations that teams can actually use.
  • Mentor architects and platform engineers; conduct architecture reviews to raise the bar across the org.
  • Work with security and compliance on HIPAA, SOX, and PCI requirements as they apply to telemetry and cloud infrastructure.

Represent CVS Health in vendor evaluations and stay connected to the open-source ecosystem (CNCF, OpenTelemetry, Grafana Labs).

Required Qualifications

  • 10+ years in infrastructure, cloud architecture, platform engineering, or SRE
  • 8+ years of architecture work in observability, cloud infrastructure, or both at a large enterprise
  • Solid experience with at least two of Azure, AWS, or GCP -- including networking, identity, compute, and storage
  • 5+ years with Kubernetes in production (OpenShift, EKS, AKS, or GKE)
  • 5+ years with OpenTelemetry or similar frameworks (collectors, SDKs, semantic conventions, pipeline design)
  • 5+ years with observability platforms: Grafana/Mimir/Loki/Tempo, Prometheus, Datadog, Splunk, Dynatrace, or comparable tools
  • Experience defining SLOs/SLIs and building alerting strategies at an organizational level
  • Proven track record writing architecture standards that other teams adopted and followed

Able to communicate clearly with both engineers and senior leadership

Preferred Qualifications

  • On-prem / private cloud experience (OpenShift Virtualization, KVM/libvirt, VMware, Dell PowerFlex or similar storage)
  • Workload identity (SPIFFE/SPIRE) and zero-trust networking
  • Infrastructure-as-code (Terraform, Pulumi, Helm, ArgoCD)
  • Streaming platforms such as Kafka or Confluent, especially in telemetry pipeline contexts
  • AIOps or ML-based anomaly detection experience
  • FinOps background -- cloud cost optimization, chargeback, unit economics
  • Service mesh (Istio, Envoy, Linkerd) or eBPF-based tools (Cilium, Pixie)
  • Involvement in open-source communities (CNCF, OpenTelemetry, etc.)
  • Healthcare, insurance, or financial services experience (HIPAA/SOX familiarity)
  • Cloud certifications are a plus but not required

Education

Bachelor's degree in Computer Science, Engineering, or a related field. Equivalent work experience accepted.

Pay Range

The typical pay range for this role is:

$144,200.00 - $288,400.00


This pay range represents the base hourly rate or base annual full-time salary for all positions in the job grade within which this position falls. The actual base salary offer will depend on a variety of factors including experience, education, geography and other relevant factors. This position is eligible for a CVS Health bonus, commission or short-term incentive program in addition to the base pay range listed above. This position also includes an award target in the company’s equity award program.

Our people fuel our future. Our teams reflect the customers, patients, members and communities we serve and we are committed to fostering a workplace where every colleague feels valued and that they belong.

Great benefits for great people

We take pride in our comprehensive and competitive mix of pay and benefits – investing in the physical, emotional and financial wellness of our colleagues and their families to help them be the healthiest they can be. In addition to our competitive wages, our great benefits include:

  • Affordable medical plan options, a 401(k) plan (including matching company contributions), and an employee stock purchase plan.

  • No-cost programs for all colleagues including wellness screenings, tobacco cessation and weight management programs, confidential counseling and financial coaching.

  • Benefit solutions that address the different needs and preferences of our colleagues including paid time off, flexible work schedules, family leave, dependent care resources, colleague assistance programs, tuition assistance, retiree medical access and many other benefits depending on eligibility.

For more information, visit https://jobs.cvshealth.com/us/en/benefits

We anticipate the application window for this opening will close on: 06/29/2026

Qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state and local laws.