Xiang An
Xiang An (Chinese: 安翔) is a research scientist working on computer vision and multimodal large models.
安翔,研究科学家,专注于计算机视觉与多模态大模型。
He has 17 main-conference papers across ICCV (4), CVPR (3), AAAI (3), EMNLP (2), ECCV (2), ICLR (1), NeurIPS (1), and ACM MM (1).
目前共有 17 篇顶会主会论文:ICCV 4 篇、CVPR 3 篇、AAAI 3 篇、EMNLP 2 篇、ECCV 2 篇,ICLR 1 篇、NeurIPS 1 篇、ACM MM 1 篇。
His research spans three directions:
他的研究主要涵盖三个方向:
- Distributed ML — sparse algorithms for large-scale classification; one machine handles 100M-class comparisons. Partial FC.
- Vision Encoders — next-generation ViT for modern MLLMs. OneVision-Encoder, RiceViT.
- Multimodal LLMs — fully-open multimodal training frameworks. LLaVA-OneVision-1.5, LLaVA-OneVision-2.
- 分布式机器学习 — 稀疏的分布式大规模分类与对比学习算法,一台机器搞定 1 亿规模的比对。Partial FC。
- 视觉编码器 — 面向现代 MLLM 的下一代 ViT。OneVision-Encoder、RiceViT。
- 多模态大模型 — 完全开源的多模态训练框架。LLaVA-OneVision-1.5、LLaVA-OneVision-2。
For a complete list of publications, see All Publications.
完整论文列表请参见所有发表论文。
Publications §
发表论文 §
The following is a selection of notable publications. For a complete list, see All Publications.
以下为代表性论文精选。完整列表请参见所有发表论文。
Awards & Competitions §
荣誉与竞赛 §
- ICCV 2025 Outstanding Reviewer
- CVPR 2024 Outstanding Reviewer
- Ranked 1st in NIST FRVT Competition, Visa Track 1:1
- 2024 中国年度力量人物提名
- Ranked 1st in the graduate entrance examination (major)
- First Place in Vehicle Re-Identification, PRCV 2019
- ICCV 2025 杰出审稿人
- CVPR 2024 杰出审稿人
- NIST FRVT 竞赛 Visa Track 1:1 第一名
- 2024 中国年度力量人物提名
- 研究生入学考试(专业课)第一名
- PRCV 2019 车辆重识别第一名
Open Source §
开源项目 §
- Open Source Library
- Multimodal LLM Framework
- Vision Encoder
- Image Retrieval Framework
- Large Multimodal Model
- Educational Project
Citation Map §
引用地图 §
City-level citing-author locations generated offline from Semantic Scholar + OpenAlex.
引用作者的城市级地理分布,通过 Semantic Scholar + OpenAlex 离线生成。
This page is styled after Wikipedia.
本页面样式参考自维基百科。