Back to Projects
ASL Recognition
CompletedPythonTensorFlowMediaPipe+2 more

ASL Recognition

Real-time bidirectional ASL alphabet recognition using BiLSTM-Attention network with confidence-gated decoding.

Timeline

2 months

Role

ML Researcher

Team

Solo

Status
Completed

Technology Stack

Python
TensorFlow
MediaPipe
BiLSTM
OpenCV

Key Challenges

  • Real-time inference
  • Dataset curation
  • Attention mechanism
  • Confidence-gated decoding

Key Learnings

  • BiLSTM architectures
  • Attention networks
  • MediaPipe integration
  • Research paper writing

ASL Alphabet Recognition System

Overview

A real-time ASL fingerspelling recognition pipeline using hand landmarks and a BiLSTM + Attention model with confidence-gated decoding, published as a TechRxiv preprint.

Research Highlights

  • Published: TechRxiv preprint (2026)
  • Validation Accuracy: 99.4%
  • Top-3 Accuracy: 100%
  • Real-time Performance: ~30 FPS

Dataset

  • Custom dataset of ~2,500 sequences
  • Over 75,000 frames across 26 ASL classes
  • Curated and annotated for research quality

Technical Architecture

  • Feature Extraction: MediaPipe for 63-dimensional hand landmarks
  • Model: 1.36M parameter BiLSTM-Attention network
  • Decoding: Confidence-gated decoding for robust inference

Tech Stack

  • Python for development
  • TensorFlow for deep learning
  • MediaPipe for hand tracking
  • BiLSTM for sequence modeling
  • OpenCV for video processing

Key Contributions

  • Built real-time inference pipeline from webcam video
  • Emphasized robustness for real-world deployment
  • Created comprehensive research documentation

Design & Developed by Akhil Chava
© 2026. All rights reserved.