Zhihui Li

E-Mail:

Administrative Position:无

Degree:Dr

Discipline: Computer Science and Technology

Research Focus

Current position: Home > Research Focus

Multimodal Foundation Models

Hits:

Developing efficient and robust multimodal foundation models that integrate vision, language, speech, and actions. Research topics include cross-modal representation learning, alignment, reasoning, and generation, with an emphasis on open-world generalization and knowledge-enhanced modeling for diverse downstream tasks.