We use cookies. Find out more about it here. By continuing to browse this site you are agreeing to our use of cookies.
#alert
Back to search results
New

Software Engineer (Windows)

Lenovo
United States, North Carolina, Morrisville
Sep 26, 2025


General Information
Req #
WD00088681
Career area:
Hardware Engineering
Country/Region:
United States of America
State:
North Carolina
City:
Morrisville
Date:
Friday, September 26, 2025
Working time:
Full-time
Additional Locations:
* United States of America - North Carolina - Morrisville

Why Work at Lenovo
We are Lenovo. We do what we say. We own what we do. We WOW our customers.
Lenovo is a US$69 billion revenue global technology powerhouse, ranked #196 in the Fortune Global 500, and serving millions of customers every day in 180 markets. Focused on a bold vision to deliver Smarter Technology for All, Lenovo has built on its success as the world's largest PC company with a full-stack portfolio of AI-enabled, AI-ready, and AI-optimized devices (PCs, workstations, smartphones, tablets), infrastructure (server, storage, edge, high performance computing and software defined infrastructure), software, solutions, and services. Lenovo's continued investment in world-changing innovation is building a more equitable, trustworthy, and smarter future for everyone, everywhere. Lenovo is listed on the Hong Kong stock exchange under Lenovo Group Limited (HKSE: 992) (ADR: LNVGY).
This transformation together with Lenovo's world-changing innovation is building a more inclusive, trustworthy, and smarter future for everyone, everywhere. To find out more visit www.lenovo.com, and read about the latest news via our StoryHub.
The Advanced AI Technology Center (AAITC)-Lenovo's global AI Center of Excellence-is driving our transformation into an AI-first organization. We are assembling a world-class team of researchers, engineers, and innovators to position Lenovo and its customers at the forefront of the generational shift toward AI.
Lenovo is one of the world's leading computing companies, delivering products across the entire technology spectrum, spanning wearables, smartphones (Motorola), laptops (ThinkPad, Yoga), PCs, workstations, servers, and services/solutions. This unmatched breadth gives us a unique canvas for AI innovation, including the ability to rapidly deploy cutting-edge foundation models and to enable flexible, hybrid-cloud, and agentic computing across our full product portfolio.
To this end, we are building the next wave of AI core technologies and platforms that leverage and evolve with the fast-moving AI ecosystem, including novel model and agentic orchestration & collaboration across mobile, edge, and cloud resources.
This space is evolving fast and so are we. If you're ready to shape AI at a truly global scale, with products that touch every corner of life and work, there's no better time to join us.

Description and Requirements

Lenovo is seeking an experienced Software Engineer to lead the integration and implementation of Small Language Model (SLM) inferencing for our next-generation AI systems onto our Windows laptop and desktop computers. If you are passionate about making Smarter Technology For All, come help us realize our Hybrid AI vision!

Responsibilities:

  • Design, implement, and maintain core agent runtimes responsible for:

  • Dynamic model loading and lifecycle management

  • Scheduling, prioritization, and queuing of model inference requests

  • Information retrieval, data preprocessing, and context preparation pipelines

  • Develop system integrations to support interoperability between Windows applications, services, and AI runtime components

  • Implement security and privacy controls, including process isolation, sandboxing, audit logging, and compliance with enterprise-grade software security standards

  • Optimize runtime performance for latency, throughput, and memory footprint across heterogeneous compute platforms (CPU, GPU, NPU) and across various vendor AI Frameworks (such as OpenVino, RyzenAI, and QNN)

  • Embed safety and interpretability features into the stack, including guardrails, telemetry, and explainability mechanisms

  • Contribute to platform reliability, including fault tolerance, error handling, and automated recovery strategies

  • Collaborate with cross-functional teams (AI researchers, product managers, QA, and DevOps) to deliver robust, production-ready solutions

  • Mentor junior engineers by providing technical guidance, conducting code reviews, and supporting onboarding

  • Author technical documentation and best practices, contributing to internal knowledge bases, design proposals, and architecture reviews

Required Qualifications:

  • Expertise in Windows development, including Win32 APIs, DLL development, and system-level programming

  • Strong understanding of Windows software security best practices (e.g., secure coding, privilege management, sandboxing)

  • Experience with GGML, GGUF, and llama.cpp for local model inference

  • Proficiency in C/C++ for client and systems development

Preferred Qualifications:
  • Familiarity with ONNX, OpenVino, RyzenAI, and QNN runtimes

  • Working knowledge of Python and deep learning frameworks such as PyTorch

  • Strong problem-solving and debugging skills in complex, multi-threaded environments

  • Experience with Kotlin Multiplatform (KMP) or other cross-platform development frameworks

  • Contributions to open-source AI runtimes, libraries, or Windows utilities

  • Background in performance optimization, compiler toolchains, or low-level hardware acceleration

#AAITC

We are an Equal Opportunity Employer and do not discriminate against any employee or applicant for employment because of race, color, sex, age, religion, sexual orientation, gender identity, national origin, status as a veteran, and basis of disability or any federal, state, or local protected class.
Additional Locations:
* United States of America - North Carolina - Morrisville
* United States of America
* United States of America - North Carolina
* United States of America - North Carolina - Morrisville

Applied = 0

(web-759df7d4f5-7gbf2)