speechbrain.lobes.models
Package defining neural netword models (CRDNN, Xvectors …)
A combination of Convolutional, Recurrent, and Fully-connected networks. |
|
The SpeechBrain implementation of ContextNet by https://arxiv.org/pdf/2005.03191.pdf |
|
A popular speaker recognition and diarization model. |
|
This lobes replicate the encoder first introduced in ESPNET v1 |
|
Wide ResNet for Speech Enhancement. |
|
Neural network modules for the HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis |
|
Generator and discriminator used in MetricGAN |
|
Generator and discriminator used in MetricGAN-U |
|
Implementation of a Recurrent Language Model. |
|
Neural network modules for the Tacotron2 end-to-end neural Text-to-Speech (TTS) model |
|
Vanilla Neural Network for simple tests. |
|
A popular speaker recognition and diarization model. |
|
Implementation of a popular speech separation model. |
|
This is a module to ensemble a convolution (depthwise) encoder with or without residule connection. |
|
Library to support dual-path speech separation. |
|
This lobe enables the integration of fairseq pretrained wav2vec models. |
|
Library for the Reseource-Efficient Sepformer. |
|
This file contains two PyTorch modules which together consist of the SEGAN model architecture (based on the paper: Pascual et al. https://arxiv.org/pdf/1703.09452.pdf). |
|
Components necessary to build a wav2vec 2.0 architecture following the original paper: https://arxiv.org/abs/2006.11477. |
High level processing blocks. |