- I’m Sheng.
- My focus is multimodal learning, especially VQA, and I’m currently exploring multimodal LLMs.
- 💬 As an ENFJ-A, I thrive on meaningful collaboration and communication.
- 📫 You can reach me at [email protected]—let’s connect!
🐢
Focusing
Postdoc.
-
King Abdullah University of Science and Technology
- Saudi Arabia
-
12:12
(UTC +03:00) - https://zhousheng97.github.io/
Pinned Loading
-
EgoTextVQA
EgoTextVQA Public[CVPR'25] 🌟🌟 EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.
