AI Inference Engineer Job at Signify Technology, Fremont, CA

SFFJUExCVU5XWjlnWllqR2h5SXNHZkI3WFE9PQ==
  • Signify Technology
  • Fremont, CA

Job Description

AI Inference Engineer – Stealth Startup | San Fransisco Onsite

Compensation: $200K–$300K + equity

Join a stealth-stage team backed by prominent academic research and successful technical founders, working at the bleeding edge of AI infrastructure. As generative AI continues to scale rapidly, the bottleneck is no longer training—it’s inference. This team is rebuilding the core systems that power inference, from kernel-level GPU optimizations to full-stack distributed deployment.

This role is ideal for engineers who want to go deep: working on quantization, KV caching, attention mechanisms like FlashAttention, and designing new strategies for parallelism across heterogeneous compute. You'll contribute to an integrated software-hardware stack that enables large-scale model deployment with dramatically improved performance, efficiency, and quality—at production scale.

What You’ll Be Doing:

  • Research and implement state-of-the-art techniques to improve AI model inference speed and quality
  • Architect and optimize distributed AI infrastructure across both GPU kernel and software layers
  • Profile, benchmark, and debug system performance across varied hardware environments
  • Drive improvements in model execution through compiler-level tuning, caching, and runtime strategies

What They’re Looking For:

  • Bachelor's degree in Computer Science, Engineering, Applied Math, or a related field
  • Strong experience with performance optimization and systems-level thinking
  • Proficiency in Python, C++, and CUDA
  • Familiarity with AI frameworks like PyTorch, TensorFlow, ONNX, or vLLM

Nice to Have:

  • Graduate degree in a technical field
  • Experience with MLIR or other compiler frameworks
  • Hands-on work with large-scale GPU infrastructure or custom kernels

This is a hands-on, foundational role in a fast-moving environment, offering the chance to shape the backbone of the next generation of AI systems.

Job Tags

Similar Jobs

Richards Building Supply

Class B CDL - Truck Driver Job at Richards Building Supply

 ...per the customer's request as well as receiving materials in the warehouse, loading trucks, and processing inventory. 1 year CDL Class B...  ...season hours. Flexible work/life balanced hours, home every night and typical schedule Monday-Friday 7:00 A.M. 4:00 P.M.... 

Meyer's RV & Marine

Title Clerk Job at Meyer's RV & Marine

 ...Description: Meyer's of Syracuse is seeking a talented, fresh, energetic individual to join our team and assist in our title office. The Title Clerk is accountable for performing the duties and responsibilities described below: Submit complete and accurate title/... 

Cox Communications

Business Development Consultant (Cox Media) Job at Cox Communications

Join the Cox Media Team as a Business Development Consultant! About Us: With nearly 30 offices across 13 states, Cox Media reaches 6 million households, connecting advertisers to audiences on multiple screens. From cable TV to cutting-edge digital products, we craft... 

TikTok

Music Product Counsel - Global Legal (San Jose) Job at TikTok

1 day ago Be among the first 25 applicants Responsibilities Responsibilities: - You will be a principal point of contact on the Global Music Legal team for various internal stakeholders, including Product and AI research teams, which are responsible for AI-related...

Free TV Networks LLC

Motion Graphic Designer Job at Free TV Networks LLC

DescriptionOverviewDesign and create smart, modern, and clean creative solutions for our FAST and national OTA networks.Responsibilities...  ...or equivalent work experience.1 - 5 years of experience of motion and graphic design.Experience with multiple dimension animation....