Primary Skills: Automation, Containerization, CI/CD,
Kubernetes, Cloud (Azure, AWS, GCP), Software Development Duration:
Direct Hire Location: San Francisco, CA (remote until restrictions
are lifted) Category: Networking, Systems, Security & DevOps
Senior DevOps Engineer SaaS Operations
About Us An application performance monitoring solution that
uses machine learning and artificial intelligence (AI) to provide
real-time visibility and insight into IT environments. With our
unique AIOps solution, you can take the right action at exactly the
right time with automated anomaly detection, rapid root-cause
analysis, and a unified view of your entire application ecosystem,
including private and public clouds.
About You First and foremost, you have strong troubleshooting
and problem resolution skills. You work well under pressure and
have strong written and verbal communications skills. You pride
yourself in being a self-starter who leads by example and has
experience working in a rapidly changing environment.
You also have:
Minimum of a bachelors degree in CSE, EE, CSM, or related
technical discipline; MS degree desired Minimum of a combined 5+
years of Site Reliability, DevOps, and/or Software Development
experience, ideally in a growth-stage environment Experience
operating within, and supporting, complex SaaS production or
revenue-critical 24/7 web services environments.
Must have experience developing and operationalizing system
installations and upgrades.
Strong Experience with Unix/Linux system administration especially
in RedHat Linux (CentOS) Experience running and administering
services in AWS or other cloud platforms (Azure, GCP) Significant
experience with one or more scripting/coding languages, ideally
with Terraform or Python.
Experience with big data platform engineering Experience with
scaling and operationalizing distributed data stores, file systems,
and services (Kafka, Elasticsearch, etc.); familiarity with Lambda
architecture a big plus Experience with virtualization and
containerization platforms (Docker), container orchestration tools
(Kubernetes) and aspects of Kubernetes to facilitate ease of
delivery (Istio/Helm/Kube2Iam) Availability for occasional on-call
Day-to-day responsibilities include:
Helping to build and infrastructure to facilitate rapid service
Documenting findings and recommendations for improvement
Responsible helping leads full-stack platform infrastructure
Maintaining and enhancing deployment tools and methodologies; play
a lead role in advancing our 'Infrastructure as code'
Lead the evaluation and development of our data ingestion pipeline
to be deployed 'as a service.
Creating repeatable, efficient, and scalable artifact deployment
pipelines Making recommendations to, and interfacing with
engineering to ensure 100% application uptime.
Monitor the SaaS environment and work with QA, Developers, Ops to
identify and solve problems.
Ensure that failover mechanisms are in place and are working
correctly Responding to and resolving technical emergencies
To follow up with any questions, please contact Charm Luna at
Akraya is an award-winning IT staffing firm and the staffing
partner of choice for many leading companies across the US. We
offer comprehensive benefits including Health Insurance (medical,
dental, and vision), Cafeteria Plan (HSA, FSA, and dependent care),
401(k) (enrollment subject to eligibility), and Sick Pay (varies
based on city and state laws).
If this position is not quite what you're looking for, visit
akraya.com and submit a copy of your resume. We will get to work
finding you a job that is a better fit at one of our many amazing
Akraya is committed to equal treatment and opportunity in all
aspects of recruitment, selection, and employment without regard to
gender, race, religion, national origin, ethnicity, disability,
gender identity/expression, sexual orientation, veteran or military
status, or any other category protected under the law. Akraya is an
equal opportunity employer; committed to a community of inclusion,
and an environment free from discrimination, harassment, and