Hawk Sense Business Solutions Pvt Ltd (formerly known as Hawk Sense Security Systems Pvt Ltd)
Job Description
Position: DevOps / SRE Engineer Client: Acuver (Serving Notice Period candidates preferred) Location: Bangalore to Bangalore Experience: 5?8 Years Employment Type: Full Time Work Location: Bangalore to Bangalore preferred Budget Details (Based on Experience) 5 Years: Up to 18 LPA 6 Years: Up to 20 LPA 7 Years: Up to 24 LPA 8 Years: Up to 26 LPA Serving Notice Period (NP) candidates preferred. Role Overview We are looking for a mid-senior DevOps / SRE Engineer to build and scale a cloud-native, event-driven platform supporting high-throughput logistics and fulfillment systems. The role involves building infrastructure, CI/CD pipelines, observability, and ensuring system reliability and scalability. Required Skills 5?8 years in DevOps / SRE Strong AWS (Preferred) Or GCP Hands-on experience in: Kubernetes Docker Terraform CI/CD tools (GitLab, Jenkins) Experience With Kafka / Event-driven systems Prometheus / Grafana / ELK Microservices architecture Strong Knowledge Of Networking Load balancing High availability systems Good to Have Service Mesh (Istio / Linkerd) PostgreSQL / AWS RDS Snowflake exposure Logistics / Supply Chain domain DevSecOps practices Key Responsibilities CI/CD & • Release Engineering Design and implement CI/CD pipelines (GitLab CI, Jenkins) Automate build, test, and deployment workflows Implement blue-green / canary deployments Ensure rollback mechanisms and deployment governance Cloud Infrastructure & • IaC Manage infrastructure on AWS (Primary) / GCP Use Terraform for Infrastructure as Code Build reusable modules Optimize cost, performance, and scalability Containerization & • Orchestration Deploy applications using Docker & • Kubernetes Manage workloads using Helm Implement auto-scaling and high availability Reliability Engineering (SRE) Define SLIs, SLOs, SLAs Drive uptime and performance improvements Handle incident response and RCA Build fault-tolerant systems Observability & • Monitoring Metrics: Prometheus Logs: ELK / Kibana Tracing: OpenTelemetry / Jaeger Build actionable alerting systems Event-Driven Systems Work with Kafka, Pub/Sub, SQS/SNS Support distributed microservices architecture Improve throughput and resilience Database Support Work with PostgreSQL (AWS RDS) Performance tuning Backup and recovery HA and failover setup