更新于 12月17日

Model Inference Engineer (PC & Android)

面议
  • 北京海淀区
  • 5-10年
  • 硕士
  • 全职
  • 招1人

职位描述

大模型
We are looking for a senior-level engineer to focus on high-performance model inference across PC and Android platforms. The role centers on optimizing LLM/multimodal models for low latency and efficient memory use, implementing C++ runtimes, applying advanced acceleration techniques, and collaborating closely with research teams to bring optimized inference solutions into production environments.
Key Responsibilities
• Design and implement optimized model inference pipelines for PC (x86/AMD/Intel) and Android (ARM).
• Apply quantization, operator/kernal fusion, memory optimization, and runtime scheduling techniques.
• Work with at least one major inference stack: llama.cpp, Qualcomm AI SDKs (QNN/QAIRT/QSDK) or MTK Neuro Pillot; better to have experience with OpenVINO, Ryzen AI, and other inference SDKs.
• Profile and tune CPU/GPU/NPU performance using industry-standard profiling tools.
• Collaborate with model researchers to translate new methods into efficient runtime implementations.
Required Qualifications
• Master’s degree or above, with 3+ years experience in model inference, runtime engineering, or performance optimization.
• Strong C++ programming skills; familiarity with Android NDK/JNI is a plus.
• Solid understanding of transformer architectures, inference mechanisms, and acceleration methods.
• Hands-on experience with at least one of: llama.cpp, Qualcomm AI SDKs (QNN/QAIRT/QSDK) or MTK Neuro Pillot; better to have experience with OpenVINO, Ryzen AI, and other inference SDKs.
• Ability to read English technical papers and documentation; English communication preferred.
Preferred
• Experience with ONNX Runtime, TVM, XNNPack, or mobile performance tools.
• Contributions to open-source inference or optimization frameworks.

工作地点

北京市海淀区西北旺东路10号联想全球总部

职位发布者

方女士/招聘PMO

当前在线
公司Logo联想(北京)有限公司公司标签
联想(HKSE: 992)(ADR: LNVGY)是一家年收入700亿美元的全球化科技公司,位列《财富》世界500强第159名,在世界各地共有75,000名员工,服务遍布全球180个市场数以百万计的客户。为实现“智能,为每一个可能”的公司愿景,我们在不断夯实个人电脑全球市场冠军地位的基础上,更进军基础设施、手机、解决方案和服务等新的增长领域。凭借坚定执行智能化转型战略和持续开发改变世界的创新与技术,我们正在为世界各地的亿万消费者打造一个更加包容、值得信赖和可持续发展的数字化未来。欢迎访问联想官方网站 https://www.lenovo.com,并关注“联想集团”微博及微信公众号等社交媒体官方账号,获取联想最新动态。面向新一轮智能化变革的产业升级契机,联想提出智能化变革战略,围绕智能物联网(Smart IoT)、智能基础架构(Smart Infrastructure)、行业智能(Smart Verticals)三个方向,立志成为行业智能化变革的引领者和赋能者。2020/2021财年,联想进一步扩展和提升服务业务,以服务和解决方案为导向推动转型的深入,力争在未来十年内将服务和解决方案打造成联想新的核心竞争力。目前,联想核心业务由三大业务集团组成,分别为专注智能物联网的IDG智能设备业务集团、专注智能基础设施的ISG基础设施方案业务集团及专注行业智能与服务的SSG方案服务业务集团。联想集团致力于通过持续创新、卓越运营和全球布局,推进业务的可持续发展,实现基业长青。
公司主页