更新于 12月28日

Data Scientist - LLM Applications

1.5-3万
  • 上海闵行区
  • 1-3年
  • 硕士
  • 全职
  • 招1人

职位描述

LLMSLIAMAQWEN微调大语言RAG英文流利法语
Data Scientist - LLM Applications
RESPONSIBILITIES
The objective of this position is to drive innovation in projects focused on the development and
application of data analytics tools to support our engineering (e.g., ASU engineering) and
operations (e.g., ASU, SMR, Electronics Carry Gas). Focusing on the business needs of Air
Liquide in the context of our customer-centric transformation, the researcher will be responsible
to:
- Collaborate with business entities to define the problem and business requirements,
translate them into functional design specifications, and develop solutions. Identify,
evaluate and select industrial or academic partners as needed.
- Lead and participate in the design, development, and deployment of AI solutions based
on Large Language Models (LLMs) to address key challenges in industrial, R&D, and
healthcare domains.
- Lead the fine-tuning and optimization of open-source LLMs (e.g., Llama, Qwen,
DeepSeek) for specific business scenarios (such as technical document comprehension,
process parameter optimization, safety report analysis, and scientific knowledge mining).
- Expertly apply Retrieval-Augmented Generation (RAG) techniques, integrating internal
knowledge bases (e.g., technical patents, engineering manuals, research reports) with
external data to build high-accuracy intelligent Q&A, content generation, and knowledge
management systems.
- Test and verify the performance of solutions with prototypes developed.
- Define and develop business tools based upon the prototype performance verification,
ensure transfer of the tool to the operational entities and provide support for the
industrial deployment.
- Train team members on the details of the implemented methodology, thus ensuring
sustainability of the solution for Air Liquide.
- Support knowledge transfer within Air Liquide. Publish research in internal R&D reports,
at conferences and potentially in peer-reviewed journals.
- Work with IT, internal, and external organizations to obtain, clean, visualize, and analyze
data.
- Continuously track the latest advancements in NLP, LLM, and Generative AI (GenAI)
(e.g., Agents, Multi-modality), evaluating and introducing new technologies to enhance
team capabilities.
EXPECTED BACKGROUNDS
- M.S. or Ph.D. in Computer Science, Artificial Intelligence, Statistics, Mathematics,
Engineering or related fields. Independent and inter-disciplinary research experience are
preferred.
- Solid, practical experience in LLM fine-tuning with a deep understanding of its principles.
- In-depth understanding of RAG architecture with at least one complete, deployed RAG
project. Familiarity with relevant frameworks.
- Excellent fundamental understanding of statistics (e.g. distributions, probability, linear
regressions) is a must. Knowledge of advanced statistics (e.g. clustering, elastic net,
MLE, dimension reduction (PCA, PLS, etc), stochastic process, bayesian network, time
series models) and machine learning models (e.g. decision trees, random forest, SVM)
are of benefit.
- Programming experience with R and Python are preferred. Knowledge of Java, C++, or
Javascript is also of benefit.
- Excellent communication and interpersonal skills (written and oral). Must be comfortable
to work in English on a daily basis and in a multi-disciplinary and international team.
Knowledge of French is of benefit.
PREFERRED BACKGROUNDS
- Project experience in industrial manufacturing (e.g., chemical, energy), semiconductors,
healthcare, or supply chain is preferred.
- Familiarity with the selection, deployment, and optimization of vector databases (e.g.,
Milvus, Pinecone, Chroma).
- Familiarity with AI services and tools on at least one cloud platform (AWS, Azure).
- Experience with AIGC, multi-modal models, or AI Agent development.
- MLOps experience (model deployment and serving), familiar with tools like Docker,
Kubernetes, FastAPI/Gradio.
- Self motivated individual with ability to define and solve problems in collaborative ways
across teams from different backgrounds.
- Publications in top-tier AI/NLP conferences or journals are a plus.
LOCATION
Shanghai, China
ABOUT AIR LIQUIDE
A world leader in gases, technologies and services for Industry and Health, Air Liquide is
present in about 80 countries with approximately 68,000 employees and serves more than 3
million customers and patients. Oxygen, nitrogen and hydrogen are essential small molecules
for life, matter and energy. They embody Air Liquide’s scientific territory and have been at the
core of the company’s activities since its creation in 1902.

工作地点

闵行区液化空气上海研发与技术中心

职位发布者

杜女士/招聘

昨日活跃
立即沟通
公司Logo液化空气(中国)投资有限公司公司标签
液化空气集团液化空气集团——全球工业与医疗保健领域气体、技术和服务的领导者之一,业务遍及80个国家,员工约65,000人,为超过300万名客户与患者提供服务。氧气、氮气和氢气是生命、物质及能源不可或缺的小分子。它们象征着液化空气的科学疆域,自集团1902年成立以来,始终位于其业务的核心。液化空气集团的宏伟目标是领导所在行业,实现长期业绩,并致力于可持续发展。公司以客户为中心的转型战略旨在实现长期盈利性增长。这依托于全球范围内的卓越运营、选择性投资、开放式创新和网络化组织。通过员工的全心投入与不断创新,液化空气集团利用能源与环境转型、医疗保健及数字化领域的变革,为所有利益相关方创造更多价值。2016年,液化空气集团的销售额达181亿欧元,其中保护生命与环境的解决方案所占销售份额超过40%。液化空气集团在巴黎泛欧证券交易市场上市(A类),同时是法国指股CAC 40指数、欧元区斯托克50指数及富时社会责任指数成员企业。液化空气集团在中国液化空气集团早在1916年就进入中国,70年代开始向中国提供空分设备,经过近十多年业务的稳步发展,目前在中国设有近90家工厂,遍布40多个城市,拥有逾4000名员工。集团在华主要经营范围包括工业及医用气体的运营,工程与制造业务,以及先进事业技术部和上海研发与技术中心从事的创新业务。公司业务已覆盖中国主要的沿海工业区域,并继续向中部、南部和西部地区拓展。液化空气通过创造卓越绩效和履行责任追求盈利性增长和长期可持续发展,并保持在中国的行业领先地位。依托于集团的长期战略与全球资源,公司聚焦能源、环境、高科技和医疗保健等领域,以迎接挑战并创造新的市场机遇。凭借专业团队的全力支持,公司致力于为客户提供可信赖的服务与高附加值解决方案,同时履行企业社会责任。请访问以下链接以了解更多信息:www.airliquide.comwww.airliquide.com/cn/chinahttp://e.weibo.com/airliquidechina
公司主页