Find The RightJob.
ML Performance Engineer – Low Latency Inference (Trading Environment)
Location : Chicago / NYC
Experience : 3-12 years (exceptional early-career profiles considered)
If you’re working on ML today, here’s the real question:
Are you optimising models for leaderboard metrics…
Or for speed under pressure?
This role sits inside a trading business where inference latency is measured in nanoseconds. Models don’t just need to be accurate; they need to fire first.
Firms compete for a few nanoseconds of edge over the market, which can lead to millions in profit.
What You’ll Do
This is not research-only ML.
It’s ML under strict performance constraints.
You’ll Fit If
Finance experience isn’t required.
What matters is whether shaving off microseconds sounds more interesting than shipping another feature at a big lab.
If it does, it's worth a conversation.
Similar jobs
Gridiron IT
Baltimore, United States
3 days ago
Wunderlich-Malec
Tigard, United States
3 days ago
The Institute for Advanced Learning and Research
Danville, United States
3 days ago
General Motors (GM)
Warren, United States
3 days ago
Wolfe, LLC
Pittsburgh, United States
3 days ago
Tandym Group
Stamford, United States
3 days ago
General Motors (GM)
Sunnyvale, United States
3 days ago
© 2026 Qureos. All rights reserved.