Yuxiong He

Distinguished AI Software Engineer, Snowflake

Yuxiong He is a Distinguished AI Engineer at Snowflake, spearheading the development and research of Large Language Models (LLMs). As a pivotal co-leader of the Arctic project, she collaborates with a team of exceptional AI professionals to develop the Snowflake suite of foundational models. Her dedication to innovation is matched by her commitment to open source and open research, striving to build transformative and high-performing AI technologies. Previously, Yuxiong held the position of Partner Research and Product Manager at Microsoft, where she co-founded and led the DeepSpeed project. This industry-leading, open-source deep learning optimization library introduced groundbreaking innovations like ZeRO, 3D parallelism, and ZeroQuant. These advancements have significantly accelerated and democratized the training and inference processes of cutting-edge LLMs, making them more accessible to everyone in need. Yuxiong has published over 100 papers in major computer science conferences and journals. Her work has been recognized among the best papers at esteemed venues such as SIGIR, ICDE, WSDM, and Middleware, and her research continues to be widely applied in diverse systems and products.

Gen AI

Inside Snowflake Intelligence: Five Pillars of Enterprise-Grade Agentic AI

Explore the underlying architecture, orchestration, and system-level optimizations behind Snowflake Intelligence, a production-grade agentic AI system built for enterprise reasoning.

Yuxiong He

Inside Snowflake Intelligence: Five Pillars of Enterprise-Grade Agentic AI

MORE POSTSFROM Yuxiong He

Smaller Models, Smarter SQL: Arctic-Text2SQL-R1 Tops BIRD and Wins Broadly

Arctic Inference with Shift Parallelism: The Fastest Open Source Inference System for Enterprise AI

Scaling vLLM for Embeddings: 16x Throughput and Cost Reduction

Low-Latency and High-Throughput Inference for Long Context with Sequence Parallelism (aka Arctic Ulysses)

Think. Execute. Excel: Arctic Text2SQL with Execution-Guided CoT

Introducing Arctic Agentic RAG: Smarter, Faster and More Reliable AI for Enterprise

SwiftKV from Snowflake AI Research Reduces Inference Costs of Meta Llama LLMs up to 75% on Cortex AI

Where Data Does More