Muhammad Usama Saleem m-usamasaleem

Hi there 👋

I am a Ph.D. candidate in Computer Science at the University of North Carolina at Charlotte, supervised by Dr. Pu Wang in the GENIUS Lab. In industry, I work as a researcher with the Computer Vision teams at Amazon and Lowe’s where I am developing large-scale, multimodal language models (MLLMs) to enhance operational efficiency and customer experience in complex, real-world environments. Moreover, I joined Google as a Student Researcher in the Extended Reality (AR/VR) team, working on advancing multimodal and generative AI for immersive technologies.

Research Interests

My research interests lie at the intersection of computer vision and generative AI, with a focus on 3D human modeling. Specifically, I focus on 3D human pose estimation and mesh reconstruction via generative masked modeling. Moreover, I’m interested in developing multimodal motion synthesis frameworks that synthesize controllable, high-fidelity 3D human animations for real time applications.

Contact Me

If you have any research opportunities, please feel free to reach out:

Email: [email protected]

👨🏻‍💻 Technologies:

⚡ I like to play cricket, I enjoy cooking 🎨.

Get in touch!

📧 Email: [email protected]
👨🏻‍💼 LinkedIn: m-usamasaleem

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Muhammad Usama Saleem m-usamasaleem

Achievements

Achievements

Block or report m-usamasaleem

Hi there 👋

Research Interests

Contact Me

👨🏻‍💻 Technologies:

Get in touch!

Pinned Loading

Uh oh!