end2end-asr-pytorch

冷不防 2023-02-15 13:48 31阅读 0赞

end2end-asr-pytorch

https://github.com/gentaiscool/end2end-asr-pytorch

End-to-End Automatic Speech Recognition on PyTorch.
End-to-End Speech Recognition on Pytorch.
Transformer-based Speech Recognition Model.

  1. end-to-endadj. 端到端的,端点对端点的 n. 不断地
  2. automatic speech recognitionASR:自动语音识别
  3. text to speechTTS:从文本到语音
  4. speech to textSTT:从语音到文本

PyTorch
https://pytorch.org/

torchaudio: an audio library for PyTorch
https://github.com/pytorch/audio

1. pytorch==1.4.0 torchaudio==0.4.0 torchvision==0.50

1.1 get started

  1. (base) yongqiang@yongqiang:~$ conda create -n pt-1.4_py-3.6 python=3.6
  2. ......
  3. # To activate this environment, use
  4. #
  5. # $ conda activate pt-1.4_py-3.6
  6. #
  7. # To deactivate an active environment, use
  8. #
  9. # $ conda deactivate
  10. (base) yongqiang@yongqiang:~$
  11. (base) yongqiang@yongqiang:~$ conda activate pt-1.4_py-3.6
  12. (pt-1.4_py-3.6) yongqiang@yongqiang:~$
  13. conda install pytorch torchvision cpuonly -c pytorch
  14. # CUDA 9.2
  15. conda install pytorch==1.2.0 torchvision==0.4.0 cudatoolkit=9.2 -c pytorch
  16. # CUDA 10.0
  17. conda install pytorch==1.2.0 torchvision==0.4.0 cudatoolkit=10.0 -c pytorch
  18. # CPU Only
  19. conda install pytorch==1.2.0 torchvision==0.4.0 cpuonly -c pytorch
  • conda install pytorch==1.4.0 torchvision torchaudio==0.4.0 cpuonly -c pytorch

    (pt-1.4_py-3.6) yongqiang@yongqiang:~$ conda install pytorch==1.4.0 torchvision torchaudio==0.4.0 cpuonly -c pytorch
    ……

    Package Plan

    environment location: /home/yongqiang/miniconda3/envs/pt-1.4_py-3.6

    added / updated specs:

    1. - cpuonly
    2. - pytorch==1.4.0
    3. - torchaudio==0.4.0
    4. - torchvision

    ……
    Preparing transaction: done
    Verifying transaction: done
    Executing transaction: done
    (pt-1.4_py-3.6) yongqiang@yongqiang:~$

    (pt-1.4_py-3.6) yongqiang@yongqiang:~$ python
    Python 3.6.10 |Anaconda, Inc.| (default, May 8 2020, 02:54:21)
    [GCC 7.3.0] on linux
    Type “help”, “copyright”, “credits” or “license” for more information.

    import torch
    import torchvision
    import torchaudio

    torch.version
    ‘1.4.0’

    torchvision.version
    ‘0.5.0’

    torchaudio.version
    ‘0.4.0a0+719bcc7’

    exit()
    (pt-1.4_py-3.6) yongqiang@yongqiang:~$

  • bash requirement.sh

    (pt-1.4_py-3.6) yongqiang@yongqiang:~/pytorch_work/end2end-asr-pytorch$ bash requirement.sh
    ……
    (pt-1.4_py-3.6) yongqiang@yongqiang:~/pytorch_work/end2end-asr-pytorch$

2.

References

Code-Switched Language Models Using Neural Based Synthetic Data from Parallel Sentences.
Lightweight and Efficient End-to-End Speech Recognition Using Low-Rank Transformer.
A Comparison of Modeling Units in Sequence-to-Sequence Speech Recognition with the Transformer on Mandarin Chinese.

发表评论

表情:
评论列表 (有 0 条评论,31人围观)

还没有评论,来说两句吧...

相关阅读

    相关 mysqlpid ended

    mysql 分组排序,pid为 n 的行 跟在 id 为 n 的行后面 题主的这个排序需求,用SQL来解决,其难度的确比较大,不过经过特殊的排序安排还是可以解决的。请参考