Skip to content

sodonne6/EEP55C34_Advanced_AI

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

81 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

EEP55C34 Advanced AI

ASL to Speech - Group Project

Develop real-time application that translates American Sign Language (ASL) into spoken and written language, enabling seamless communication between deaf and hearing individuals.

Pipeline

Original Architecture

The original approach was inspired by SignFormer-GCN

Architecture Diagram

New Approach

The approach taken leans heavier on GCN influence by adding two more GCN paths which process the right/left hand coordinates extracted by mediapipe

New Architecture Diagram

Datasets

How2Sign

Team Members

Farida Shittu & Shane O'Donnell

Installation

Clone repo locally or in colab

git clone --recurse-submodules https://github.com/sodonne6/EEP55C34_Advanced_AI.git
cd \inside\repo\root

Clone Submodules

git submodule update --init --recursive

Repository guide

This repository contains both the runtime demo code and the training framework used during development.

  • See src/README.md for environment setup and instructions on running the application end-to-end.
  • See SLT/external/README.md for training-related notes, including the original SignFormer GCN reference repo, required override files, and the training launch workflow.

Recommended starting points:

  • Running the system: src/README.md
  • Training a new model: SLT/external/README.md

References

[1] S. H. Arib, R. Akter, S. Rahman, and S. Rahman, "SignFormer-GCN: Continuous sign language translation using spatio-temporal graph convolutional networks," PLoS ONE, vol. 20, no. 2, p. e0316298, Feb. 2025, doi: 10.1371/journal.pone.0316298.

[2] A. Duarte et al., “How2Sign: a large-scale multimodal dataset for continuous American sign language,” arXiv.org, Aug. 18, 2020. https://arxiv.org/abs/2008.08143

About

Continuous ASL-to-speech pipeline combining SignFormer-GCN, MediaPipe pose features, and speech synthesis.

Topics

Resources

Stars

Watchers

Forks

Contributors

Languages