Audioperm
A python library for generating different permutations of audible segments from audio files.
SourceByakto TTS [bangla text to speech]
Multilingual (Bangla, English) real-time speech synthesis library in Python. [75 GitHub ⭐]
SourceCLIP (Contrastive Language–Image Pre-training) for Bangla
The model consists of an EfficientNet / ResNet image encoder and a BERT text encoder and was trained on multiple datasets from Bangla image-text domain.
SourceSarcasm Detection using RNN variants
Sarcasm Detection using LSTM, GRU, and RoBERTa on SARC (reddit), sarcasm_v2, and iSARCASM (twitter) datasets.
SourceCovid-19 few shot learning from X-ray images
Few-Shot Learning with Siamese Network for Covid-19 X-ray images.
SourceKeras Human Pose
A simple wrapper (Keras) to localize human joints from images/video frames for multiple subjects.
SourcePyTorch Speech Dataloader (torch-speech-dataloader)
A ready-to-use pytorch dataloader for audio classification, speech classification, speaker recognition, etc. with in-GPU augmentations.
Source