zhousheng97

Follow

🐢

Focusing

Sheng Zhou zhousheng97

🐢

Focusing

Follow

Postdoc.

18 followers · 15 following

King Abdullah University of Science and Technology
Saudi Arabia
12:12 (UTC +03:00)
https://zhousheng97.github.io/

Achievements

Achievements

zhousheng97/README.md

Hi there 👋

I’m Sheng.
My focus is multimodal learning, especially VQA, and I’m currently exploring multimodal LLMs.
💬 As an ENFJ-A, I thrive on meaningful collaboration and communication.
📫 You can reach me at [email protected]—let’s connect!

Pinned Loading

EgoTextVQA EgoTextVQA Public

[CVPR'25] 🌟🌟 EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering

Python 47 1
ViTXT-GQA ViTXT-GQA Public

[IEEE TMM'25] Scene-Text Grounding for Text-Based Video Question Answering

Python 17
GPIN GPIN Public

[ACM TOMM'24] Graph Pooling Inference Network for Text-based VQA

Python 3
SSGN SSGN Public

[IEEE TIP'23] Exploring Sparse Spatial Relation in Graph Inference for Text-Based VQA

Python 4