About the role As a Site Reliability Engineer (SRE), you will make an impact by designing and implementing advanced observability solutions for edge computing environments. You will be a valued member of our Infrastructure & Operations team, collaborating with engineering and platform teams to ensure high availability, reliability, and performance across distributed systems. In this role, you will: Design and implement observability frameworks for edge environments, including monitoring, logging, tracing, and metrics collection. Define and maintain SLIs, SLOs, and business KPIs to improve system reliability across edge and centralized infrastructure. Build and optimize dashboards, visualizations, and alerting systems for real-time insights and rapid incident response. Implement distributed tracing and log aggregation systems to troubleshoot complex issues in edge computing. Collaborate with engineering teams to embed observability best practices into applications and infrastructure. Drive proactive issue detection and resolution, reducing MTTD and MTTR across distributed systems. Lead incident postmortems and implement observability-driven improvements to prevent recurrence. Develop automation scripts and tools to enhance observability pipelines, addressing edge-specific challenges like bandwidth and connectivity. What you need to have to be considered 3–5 years of experience in service reliability/operations for large-scale, high-performance applications in hybrid environments (on-prem and cloud). Strong scripting and automation skills for building dashboards and managing application performance. Proficiency in programming languages such as Go, Python, Java, or Rust. Hands‑on experience with databases (Oracle, SQL Server, Redis, Clickhouse, Postgres, MongoDB, or time‑series DBs). 2+ years of experience transitioning platforms to cloud and containerization (GCP, AWS, Rancher, or similar). Experience maintaining containerized applications in GKE/RKE/AKE environments. Expertise in implementing cloud observability using OpenTelemetry (OTEL) for monitoring and distributed tracing. Knowledge of networking protocols (TCP/IP, DNS) and troubleshooting in high‑pressure scenarios. These will help you stand out Experience managing application availability for 24x7 high-availability platforms. Familiarity with monitoring tools like Splunk, AppDynamics, Grafana/Prometheus, and Dynatrace. Hands‑on experience with CI/CD tools and Rally, Confluence. Knowledge of in‑memory caching solutions (Redis preferred). Strong debugging skills across integrated technical platforms and API gateways. Exposure to GCS, Cloud SQL, Spanner, Firestore, and enterprise‑level infrastructure operations. Experience with HashiCorp Vault, Vertex AI, Gen AI, and BigQuery. Work model: On-site This is an onsite position requiring presence at a Cognizant or client location in Arizona City, AZ and/or Scottsdale, AZ. We strive to provide flexibility wherever possible and support a healthy work-life balance through our wellbeing programs. The working arrangements for this role are accurate as of the date of posting. This may change based on the project you’re engaged in, as well as business and client requirements. Rest assured; we will always be clear about role expectations. Applicants may be required to attend interviews in person or by video conference. In addition, candidates may be required to present their current state or government issued ID during each interview. Salary and Other Compensation The annual salary for this position is between $60,000 – $93,500 depending on experience and other qualifications of the successful candidate. This position is also eligible for Cognizant’s discretionary annual incentive program, based on performance and subject to the terms of Cognizant’s applicable plans. Benefits Medical/Dental/Vision/Life Insurance Paid holidays plus Paid Time Off 401(k) plan and contributions Long-term/Short-term Disability Paid Parental Leave Employee Stock Purchase Plan Disclaimer The salary, other compensation, and benefits information is accurate as of the date of this posting. Cognizant reserves the right to modify this information at any time, subject to applicable law. #J-18808-Ljbffr Cognizant
...Urban Landworks is seeking an experienced Skid Loader Operator to join our winter operations team. This position is responsible for snow removal... ...Overview The Skid Loader Operator will operate a skid steer to plow, stack, and relocate snow at commercial sites...
...Employee Benefits: * Fuel Your Growth with Love's - company funded tuition assistance... ...starting at 40 hours worked per week ~ Drivers are eligible for monthly and annual bonuses... ...safest tank carriers. Drives company trucks to load and deliver fuel and commodities...
...and will be a key asset in our policy campaign efforts in Washington and beyond. Anthropic is equal parts research lab, policy think-tank, and technology startup. We care deeply about safe development of AI systems and build partnerships with governments through...
...divh2Laundry Assistant/h2pPosition: Laundry Assistant/ppWork Schedule: Rotating weekends - Morning shifts/ppJob Type: Full Time/ppWho We Are:/ppAre you looking to have fun while making a difference in the lives of others? Do you want a job that can turn into a career?...
...seeking a travel nurse RN Med Surg for a travel nursing job in Delta, Colorado. Job Description & Requirements ~ Specialty:... ...become the preferred staffing provider by delivering top-notch customer service. Join us and experience the difference firsthand! Benefits:...