warp-transducer

A fast parallel implementation of RNN Transducer (Graves 2013 joint network), on both CPU and GPU.

GPU implementation is now available for Graves2012 add network.

GPU Performance

Benchmarked on a GeForce GTX 1080 Ti GPU.

T=150, L=40, A=28	warp-transducer
N=1	8.51 ms
N=16	11.43 ms
N=32	12.65 ms
N=64	14.75 ms
N=128	19.48 ms

T=150, L=20, A=5000	warp-transducer
N=1	4.79 ms
N=16	24.44 ms
N=32	41.38 ms
N=64	80.44 ms
N=128	51.46 ms

Interface

The interface is in include/rnnt.h. It supports CPU or GPU execution, and you can specify OpenMP parallelism if running on the CPU, or the CUDA stream if running on the GPU. We took care to ensure that the library does not preform memory allocation internally, in oder to avoid synchronizations and overheads caused by memory allocation. Please be carefull if you use the RNNTLoss CPU version, log_softmax should be manually called before the loss function. (For pytorch binding, this is optionally handled by tensor device.)

Compilation

warp-transducer has been tested on Ubuntu 16.04 and CentOS 7. Windows is not supported at this time.

First get the code:

git clone https://github.com/HawkAaron/warp-transducer
cd warp-transducer

create a build directory:

mkdir build
cd build

if you have a non standard CUDA install, add -DCUDA_TOOLKIT_ROOT_DIR=/path/to/cuda option to cmake so that CMake detects CUDA.

Run cmake and build:

cmake -DCUDA_TOOLKIT_ROOT_DIR=$CUDA_HOME ..
make

if it logs

-- cuda found TRUE
-- Building shared library with no GPU support

please run rm CMakeCache.txt and cmake again.

The C library should now be built along with test executables. If CUDA was detected, then test_gpu will be built; test_cpu will always be built.

Test

To run the tests, make sure the CUDA libraries are in LD_LIBRARY_PATH (DYLD_LIBRARY_PATH for OSX).

Contributing

We welcome improvements from the community, please feel free to submit pull requests.

Reference

标签：Building,cmake,..,warp,transducer,ms,GPU,CUDA
From： https://www.cnblogs.com/wcxia1985/p/17835520.html

使用Cmake创建一个head only的库（未完待续）
IntheCMakescriptyouprovided,thesecondparameteroftheadd_library()functionisanemptystring"".ThisisacommonpatternusedinCMaketocreatean"interface-only"libraryoraheader-onlylibrary.以下是示例add_library(conve......
cmake nginx 本地开发调试
主要是学习下如何基于cmake进行nginx的本地调试，也方便学习如何开发nginx模块参考配置CMakeLists.txtcmake_minimum_required(VERSION3.10)project(ngx_hello_module) #PathtotheNginxsourcecodeset(NGINX_SOURCE_PATH${CMAKE_SOURCE_DIR}/ngi......
【BUG解决】服务器没报警但是应用接口崩了....
最近遇到一个突发问题：服务器没报警但是应用接口崩了…为其他业务系统提供一个接口，平时好好的，突然就嚷嚷反馈说访问不了了，吓得我赶紧跳起来！正常情况下在系统崩溃前，我会收到很多系统报警，但是这次它悄无声息的出问题，还是挺恐怖的然后我立马看了下服务器的情况，服务器没有报警，也可以pin......
visualstudio+cmake+环境配置，及使用过程中的一些小坑点
一、概述想要找一个免费好用的c++开发工具。期间尝试了eclipse+CDT、VSCode+Mingw64+CMake都觉得不是很好用。最后选择了社区版的VisualStudio+CMake。之所以选择这个，是因为相对于eclipse和VSCode这个更好用。就是使用过程中会遇到一些小问题。二、配置1.下载Vis......
进一步优化！视频优化时长限制从1个小时提升到了1个半小时，解锁更多优质学习视频啦！...
昨天把YouTube中文配音升级到了1.0.0版本，这次没有继续采用0.x的版本号。主要是由于这次的更新，意味着主体能力的稳定。经过几个月的优化，现在对于视频的优化效率已经很高了，所以在这次的版本更新中，我们将视频配音优化的时长从1小时增加到了1个半小时。这意味着，更多优质的长视频，都可以......
cmake编译介绍--cmakelist.txt
1.cmake编译简介单个文件编译C/C++时:gccmain.c/g++main.cpp 多代码文件时：MakeFile,解决多文件编译难问题，运行make命令编译自动完成 cmake编译引入：根据一定的规则自动生成MakeFile的，也是有语法（cmake还是依赖make编译）。自动管理makefile文件，写起来也更方便、没有makefile......
parser/../../include/contTimeMC.hh:18:10: fatal error: gsl/gsl_matrix.h: No such
001、make编译遇到如下问题：parser/../../include/contTimeMC.hh:18:10:fatalerror:gsl/gsl_matrix.h:Nosuchfileordirectory 002、查找该文件(base)[root@pc1Augustus-3.5.0]#find/-name"gsl_matrix.h"##系统中确实不存在该文件(base)......
Java中关于try...catch的return规则
本部分针对有return要求的异常捕获和处理，具体的，try...catch语句存在于方法体中。方法体中的try...catch的return总共有四种可能的地方：try,catch,finally,方法体末尾（try…catch外）。共存规则finally中的return和方法return不能同时存在。（显而易见的第一法则！）try中的return......
../include/randseqaccess.hh:21:29: fatal error: mysql++/mysql++.h: No such file
001、make编译报错：../include/randseqaccess.hh:21:29:fatalerror:mysql++/mysql++.h:Nosuchfileordirectory 002、解决方法：。 ......
../include/types.hh:16:43: fatal error: boost/archive/text_oarchive.hpp: No such
001、make编译报错如下：../include/types.hh:16:43:fatalerror:boost/archive/text_oarchive.hpp:Nosuchfileordirectory 002、 ......

安装warp-transducer时cmake.. 出现 Building shared library with no GPU support的解决办法