Skip to content
View m-usamasaleem's full-sized avatar

Block or report m-usamasaleem

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
m-usamasaleem/README.md

Hi there πŸ‘‹

I am a Ph.D. candidate in Computer Science at the University of North Carolina at Charlotte, supervised by Dr. Pu Wang in the GENIUS Lab. In industry, I work as a researcher with the Computer Vision teams at Amazon and Lowe’s where I am developing large-scale, multimodal language models (MLLMs) to enhance operational efficiency and customer experience in complex, real-world environments. Moreover, I joined Google as a Student Researcher in the Extended Reality (AR/VR) team, working on advancing multimodal and generative AI for immersive technologies.

Research Interests

My research interests lie at the intersection of computer vision and generative AI, with a focus on 3D human modeling. Specifically, I focus on 3D human pose estimation and mesh reconstruction via generative masked modeling. Moreover, I’m interested in developing multimodal motion synthesis frameworks that synthesize controllable, high-fidelity 3D human animations for real time applications.

Contact Me

If you have any research opportunities, please feel free to reach out:

πŸ‘¨πŸ»β€πŸ’» Technologies:

Technologies

⚑ I like to play cricket, I enjoy cooking 🎨.

Get in touch!

πŸ“§ Email: [email protected]
πŸ‘¨πŸ»β€πŸ’Ό LinkedIn: m-usamasaleem

Pinned Loading

  1. MaskControl MaskControl Public

    Forked from exitudio/MaskControl

    Official repository for "MaskControl: Spatio-Temporal Control for Masked Motion Synthesis" ICCV 2025 (Oral & Award Candidate)

    Python 1

  2. BAMM BAMM Public

    Forked from exitudio/BAMM

    Official repository for "BAMM: Bidirectional Autoregressive Motion Model (ECCV 2024)"

    Python

  3. DP-Shield-EDBT-22 DP-Shield-EDBT-22 Public

    JavaScript 5

  4. BingeIT BingeIT Public

    BingeIt is a movie and video streaming app to entertain users. It will provide access to various categories of movies to users to make the user's experience better. Users will have control over the…

    JavaScript

  5. Spotizer-Flask Spotizer-Flask Public

    Python

  6. MobileMechanic MobileMechanic Public

    JavaScript