「北京海淀区 Model Inference Engineer (PC & Android)招聘」

400-885-9898

更新于 12月17日

APP

Model Inference Engineer (PC & Android)

面议

北京海淀区
5-10年
硕士
全职
招1人

职位描述

大模型

We are looking for a senior-level engineer to focus on high-performance model inference across PC and Android platforms. The role centers on optimizing LLM/multimodal models for low latency and efficient memory use, implementing C++ runtimes, applying advanced acceleration techniques, and collaborating closely with research teams to bring optimized inference solutions into production environments.
Key Responsibilities
• Design and implement optimized model inference pipelines for PC (x86/AMD/Intel) and Android (ARM).
• Apply quantization, operator/kernal fusion, memory optimization, and runtime scheduling techniques.
• Work with at least one major inference stack: llama.cpp, Qualcomm AI SDKs (QNN/QAIRT/QSDK) or MTK Neuro Pillot; better to have experience with OpenVINO, Ryzen AI, and other inference SDKs.
• Profile and tune CPU/GPU/NPU performance using industry-standard profiling tools.
• Collaborate with model researchers to translate new methods into efficient runtime implementations.
Required Qualifications
• Master’s degree or above, with 3+ years experience in model inference, runtime engineering, or performance optimization.
• Strong C++ programming skills; familiarity with Android NDK/JNI is a plus.
• Solid understanding of transformer architectures, inference mechanisms, and acceleration methods.
• Hands-on experience with at least one of: llama.cpp, Qualcomm AI SDKs (QNN/QAIRT/QSDK) or MTK Neuro Pillot; better to have experience with OpenVINO, Ryzen AI, and other inference SDKs.
• Ability to read English technical papers and documentation; English communication preferred.
Preferred
• Experience with ONNX Runtime, TVM, XNNPack, or mobile performance tools.
• Contributions to open-source inference or optimization frameworks.

工作地点

北京市海淀区西北旺东路10号联想全球总部

完善一份简历
1736万+企业在线搜索，780万+海量职位精准推荐

相似职位

下一代网络与人工智能科学家/专家2-3万
北京 - 海淀
亚信科技(中国)有限公司
生理信号算法工程师1.5-2.5万
北京 - 大兴
北京择天众康科技有限公司
算法工程师（通信仿真）-通信2-3万
北京 - 丰台
通号低空智能科技有限公司
RAG算法工程师1.2-1.8万
北京 - 海淀
小哆智能科技(北京)有限公司
多物理场耦合研发工程师2-4万
北京 - 海淀
英特工程仿真技术(大连)有限公司
人工智能应用研究工程师1.2-2万
北京 - 东城
北京安信创业信息科技发展有限公司

查看更多相似职位

职位发布者

方女士/招聘PMO

当前在线

联想(北京)有限公司公司标签

联想（HKSE: 992）（ADR: LNVGY）是一家年收入700亿美元的全球化科技公司，位列《财富》世界500强第159名，在世界各地共有75,000名员工，服务遍布全球180个市场数以百万计的客户。为实现“智能，为每一个可能”的公司愿景，我们在不断夯实个人电脑全球市场冠军地位的基础上，更进军基础设施、手机、解决方案和服务等新的增长领域。凭借坚定执行智能化转型战略和持续开发改变世界的创新与技术，我们正在为世界各地的亿万消费者打造一个更加包容、值得信赖和可持续发展的数字化未来。欢迎访问联想官方网站 https://www.lenovo.com，并关注“联想集团”微博及微信公众号等社交媒体官方账号，获取联想最新动态。面向新一轮智能化变革的产业升级契机，联想提出智能化变革战略，围绕智能物联网（Smart IoT)、智能基础架构(Smart Infrastructure)、行业智能（Smart Verticals）三个方向，立志成为行业智能化变革的引领者和赋能者。2020/2021财年，联想进一步扩展和提升服务业务，以服务和解决方案为导向推动转型的深入，力争在未来十年内将服务和解决方案打造成联想新的核心竞争力。目前，联想核心业务由三大业务集团组成，分别为专注智能物联网的IDG智能设备业务集团、专注智能基础设施的ISG基础设施方案业务集团及专注行业智能与服务的SSG方案服务业务集团。联想集团致力于通过持续创新、卓越运营和全球布局，推进业务的可持续发展，实现基业长青。

公司主页

关于我们: 公司介绍; 联系我们; 诚聘英才

产品与服务: 人才招聘; 企业招聘

使用与帮助: 账号注销; 意见反馈; 发票制度; 防骗指南; 法律协议; 资质公示

智联招聘更懂你的价值

智联app小程序官方微信企业版APP

京ICP备17067871号合字B2-20210134

京公网安备 11010502030147号人力资源许可证:1101052003273号

网上有害信息举报专区违法不良信息举报电话:400-885-9898 关爱未成年举报热线:400-885-9898-7

朝阳区人力资源与社会保障局监督电话

网络110报警服务电子营业执照