Job Details

Site Reliability Engineer, AI/ML Platforms

  2025-05-06     Adobe     Airway Heights,WA  
Description:

JOB LEVEL

P40

ADDITIONAL JOB LEVELS

P50

EMPLOYEE ROLE

Individual Contributor

The Opportunity

We're looking for an outstanding Site Reliability Engineer for Adobe's AI Training and Inference Platforms within Adobe Firefly. You will be part of a team of Site Reliability Engineers closely working with the Engineering teams on building, scaling, and securing the AI Platform. This enables the Firefly product teams to easily manage and deploy Machine Learning capabilities used by Adobe client applications.

The Applied Research groups from Adobe Research and other App Teams in Adobe will deploy thousands of models onto this platform in a variety of lifecycle stages (early research, development, productization, optimization, etc). This platform will offer ML model training and serving at scale, with high-cost efficiency, and on a wide variety of hardware platforms across multiple clouds.

What You'll Do

  • Identify and implement methodologies and solutions to increase reliability, scalability, security, and efficiency.
  • Ensure the highest uptime and Quality of Service (QoS) for Adobe's customers through operational excellence.
  • Define service level objectives (SLOs) and indicators (SLIs) to represent and measure service quality.
  • Support and maintain globally distributed, multi-cloud (public and/or private) environments.
  • Automate common, repeatable tasks at a large scale to streamline operational procedures.
  • Identify areas to improve service resiliency through techniques such as chaos engineering, performance/load testing, etc.
  • Coordinate with other Adobe platform teams and service providers (primarily AWS) to innovate on Generative AI as a Service.

What You'll Need to Succeed

  • A Bachelor's or Master's degree in Computer Science, Electrical Engineering, a related field, and 5+ years relevant industry experience.
  • You excel in undefined environments and get excited about finding pragmatic solutions to complex technical or organizational challenges.
  • You keep up with the industry trends and grow your knowledge and skills to solve technical problems.
  • Experience in building and scaling distributed systems, as well as experience with containerization and orchestration technologies like Kubernetes.
  • Production level expertise with containerization orchestration engines (e.g. Kubernetes) and proven understanding of modern, continuous development techniques and pipelines (IaC, CI/CD, ArgoCD, Git).
  • Fundamental programming skills, ideally practical experience in one (and preferably more) of the following languages: Python, Go.
  • Good knowledge of infrastructure configuration management tools like Ansible and Terraform.
  • Experience in using observability and tracing-related tools like InfluxDB, Prometheus, and Elastic Stack.
  • An understanding of AI/ML, including ML frameworks, public cloud, and commercial AI/ML solutions - familiarity with Pytorch, SageMaker, HuggingFace, NVIDIA TensorRT or OpenAI Triton a plus.

Application Window Notice

There is no deadline to apply to this job posting because Adobe accepts applications for this role on an ongoing basis. The posting will remain open based on hiring needs and position availability.

Our compensation reflects the cost of labor across several U.S. geographic markets, and we pay differently based on those defined markets. The U.S. pay range for this position is $133,900 -- $242,000 annually. Pay within this range varies by work location and may also depend on job-related knowledge, skills, and experience. Your recruiter can share more about the specific salary range for the job location during the hiring process.

Adobe will consider qualified applicants with arrest or conviction records for employment in accordance with state and local laws and “fair chance” ordinances.

Adobe is an equal opportunity and affirmative action employer. We welcome and encourage diversity in the workplace regardless of gender, race or color, ethnicity or national origin, age, disability, religion, sexual orientation, gender identity or expression, veteran status, or any other characteristics protected by law.

If you have a disability or special need that requires accommodation to navigate our internal careers site or to complete the application process, please contact ...@adobe.com.

#J-18808-Ljbffr


Apply for this Job

Please use the APPLY HERE link below to view additional details and application instructions.

Apply Here

Back to Search