Machine Learning Backend Engineer Intern

About the Job
Overview
Nexa AI is an on-device AI research and deployment company. We specialize in tiny, multimodal models (e.g. Octopus v2, OmniVLM, OmniAudio), local on-device inference framework (e.g. nexa-sdk), and model optimization techniques (e.g. NexaQuant). Our work has been recognized by industry leaders like Google, Hugging Face, AMD, and more. And we partner with enterprises and SMBs to bring local intelligence to every device.
Responsibilities
Nexa AI is an on-device AI research and deployment company. We specialize in tiny, multimodal models (e.g. Octopus v2, OmniVLM, OmniAudio), local on-device inference framework (e.g. nexa-sdk), and model optimization techniques (e.g. NexaQuant). Our work has been recognized by industry leaders like Google, Hugging Face, AMD, and more. And we partner with enterprises and SMBs to bring local intelligence to every device.
Responsibilities
- Write stable, testable infrastructure
- Diagnose and fix bugs and performance issues
- Contribute to the development of our SDKs across multiple platforms, including Windows, MacOS, Android, iOS, and Linux
- Minimum BS/MS in Computer Science
- Excellent understanding of computer science fundamentals, including data structures, algorithms, and coding
- Knowledge of operating system internals, compilers, and low-power/mobile optimization
- Experience with low-level programming in C and frameworks like CUDA, OpenCL
- Proficiency in multithreading and performance optimization
- Part Time: Remote 20hrs+/week
- Full Time: Cupertino, California
- How to apply: *
Recommended Jobs
Painter
Updated 4 days ago
R&D Engineer II
Updated 4 days ago
Executive Assistant, Communications & Corporate Responsibility
Updated 4 days ago
Social Media Content Creator
Updated 4 days ago
Payroll & Benefits Specialist
Updated 4 days ago