Zabir's quests

Audioperm

A python library for generating different permutations of audible segments from audio files.

Byakto TTS [bangla text to speech]

Multilingual (Bangla, English) real-time speech synthesis library in Python. [75 GitHub ⭐]

CLIP (Contrastive Language–Image Pre-training) for Bangla

The model consists of an EfficientNet / ResNet image encoder and a BERT text encoder and was trained on multiple datasets from Bangla image-text domain.

Source

Sarcasm Detection using RNN variants

Sarcasm Detection using LSTM, GRU, and RoBERTa on SARC (reddit), sarcasm_v2, and iSARCASM (twitter) datasets.

Source

Covid-19 few shot learning from X-ray images

Few-Shot Learning with Siamese Network for Covid-19 X-ray images.

Source

Keras Human Pose

A simple wrapper (Keras) to localize human joints from images/video frames for multiple subjects.

Source

PyTorch Speech Dataloader (torch-speech-dataloader)

A ready-to-use pytorch dataloader for audio classification, speech classification, speaker recognition, etc. with in-GPU augmentations.

Source

Qt Motion Analysis

Pose estimation with descriptive analysis.

Source

Code