Hiring.Camp

Engineer, Inference & Model serving

techire ai

Apr 30, 2026

San Francisco, CARemote, OnsiteFull-timeEngineering$220k – $320k

Skills & Technologies

PythonGoKubernetesPyTorch

Ready to apply?

Opens techire ai's career page

Apply Now

Similar Jobs

30

Inference Engineer

San Francisco, CA

Inference Engineer

*HQ - San Francisco, CA

DevOps Engineer (AI Inference)

Poland, Serbia, Cyprus, Georgia

DevOps Engineer (AI Inference)

Poland, Serbia, Cyprus, Georgia

Principal LLM Inference Engineer

Santa Clara

Machine Learning Engineer, Causal Inference, Level 5

Santa Monica - 2850 Ocean Park Blvd, United States of America +3 more

Staff + Senior Software Engineer, Inference Deployment

San Francisco, CA | New York City, NY | Seattle, WA +1 more

Staff AI Inference and Acceleration Engineer

San Jose, CA +1 more

Software Engineer, Inference Platform

Sunnyvale, CA

Staff Software Engineer, Inference Platform

Sunnyvale, CA

Application Software Engineer, Inference

Palo Alto, CA

Inference Optimization Engineer (local / edge runtime)

USA - CA - Santa Clara, United States of America +3 more

Systems Software Engineer (Rust, ML Inference)

Germany

Systems Software Engineer (Rust, ML Inference)

Germany

AI Computing Software Development Engineer, LLM Inference

China, Shanghai +1 more

AI Computing Software Development Engineer, LLM Inference

Shanghai, Shanghai,CN, CN +1 more

Staff Software Engineer, Machine Learning Inference Platform

Pittsburgh, PA or Remote

Senior Software Engineer, Machine Learning Inference Platform

Pittsburgh, PA or Remote

Staff+ Software Engineer, Inference Runtime

Remote-Friendly (Travel-Required) | San Francisco, CA | Seattle, WA | New York City, NY +1 more

Senior Inference Engineer, AIConfigurator for Dynamo

US, CA, Santa Clara, United States of America +1 more

Senior Inference Engineer, AIConfigurator for Dynamo

Santa Clara, CA,US, US +1 more

Senior Security Engineer - Akamai Inference Cloud - Remote/Poland

Poland, PL

Staff + Senior Software Engineer, Inference

San Francisco, CA | New York City, NY | Seattle, WA +1 more

Machine Learning Engineer II - Autonomous Driving & Inference Runtime

Anywhere, USA

Staff + Sr. Software Engineer, Cloud Inference Launch Engineering

San Francisco, CA +1 more

Staff + Sr. Software Engineer, Cloud Inference

San Francisco, CA +1 more

Sr. Lead AI Engineer (Inference Optimization, FM hosting, AI Platform)

San Jose, CA, United States of America +4 more

Software Engineer- BIS (Baseten Inference Stack)

San Francisco

Lead AI Engineer (FM Hosting, LLM Inference)

New York, NY, United States of America +3 more

Lead AI Engineer (FM Hosting, LLM Inference)

New York, NY, United States of America +3 more