โ† Back to jobs

Site Reliability Engineer (SRE)

Tranzeal, Inc.
CONTRACTOR Remote ยท US San Jose, CA, US Posted: 2026-05-16 Until: 2026-07-18
Apply Now โ†’
Click to apply for this remote job opportunity.
Complete your application on the next page.
Job Description
Site Reliability Engineer (SRE) As a Site Reliability Engineer, you will collaborate closely with software engineering and infrastructure teams to deploy, scale, and operate cloud-native platforms and services. This role requires strong problem-solving skills, attention to detail, and deep expertise in Kubernetes, automation, and platform reliability. Key Responsibilities Design, build, and manage a scalable multi-cluster Kubernetes platform capable of provisioning infrastructure, workloads, and cloud resources on demand, including controllers, CRDs, and automated ingress/DNS/TLS management. Develop and strengthen platform services and operational tooling, including CI/CD pipelines, SSO integration, RBAC frameworks, secrets management, encryption, and observability solutions. Integrate AI-powered tooling and automation into platform operations, including the development of agents and support tools that improve SRE efficiency and scalability. Own and optimize the production release lifecycle, including Helm-based deployments, multi-architecture container builds, staged rollouts, and rollback strategies. Implement platform-wide monitoring, audit logging, analytics, and operational automation to support reliable service delivery at scale.