首页 > 其他分享 >matlab工具voicebox函数中文说明

matlab工具voicebox函数中文说明

时间:2022-11-25 11:37:59浏览次数:48  
标签:real Convert Calculate signal 中文 Speech matlab PCM voicebox


需要自己去下载文件解压到toolbox里面并设置路径方可使用

Voicebox:在matlab使用的语音程序工具
  一些文件使用加前缀"v_"避免命名冲突
 
  音频文件输入或输出 

    readwav       - 读取WAV文件
    writewav      - 写WAV文件
    readhtk       - 读 HTK waveform文件
    writehtk      - 写 HTK waveform 文件
    readsfs       - 读 SFS文件
    readsph       - 读 SPHERE/TIMIT waveform 文件
    readaif       - 读 AIFF Audio Interchange file format 文件
    readcnx       - 读 BT Connex database 文件
    readau        - 读 AU文件(from SUN)
    readflac      -读 FLAC 文件
 

  频率尺度转换

    frq2bark      - Convert Hz to the Bark frequency scale利用基本频率hz转换到Bark频率尺度
    frq2cent      - Convert Hertz to cents scale利用基本频率hz转换到cents尺度
    frq2erb       - Convert Hertz to erb rate scale利用基本频率hz转换到erb比例尺度
    frq2mel       - Convert Hertz to mel scale利用基本频率hz转换到梅尔尺度
    frq2midi      - Convert Hertz to midi scale of semitones利用基本频率hz转换到MIDI文件音高
    bark2frq      - Convert the Bark frequency scale to Hz 利用Bark频率尺度转换到基本频率hz
    cent2frq      - Convert cents scale to Hertz利用cents尺度转换到基本频率hz
    erb2frq       - Convert erb rate scale to Hertz利用erb比尺度转换到基本频率hz
    mel2frq       - Convert mel scale to Hertz利用梅尔尺度转换高基本频率hz
    midi2frq      - Convert midi scale of semitones to Hertz利用midi文件音高转换到基本频率hz
 
 

傅里叶Fourier/离散余弦DCT/离散哈脱莱Hartley 变换 

    rfft          - FFT of real data实数的傅里叶变换
    irfft         - Inverse of FFT of real data实数的反傅里叶变换
    rsfft         - FFT of real symmetric data实对称数据的傅里叶变换
    rdct          - DCT of real data实数的离散余弦变换
    irdct         - Inverse of DCT of real data实数的反离散余弦变换
    rhartley      - Hartley transform of real data实数的离散哈脱莱变换
    zoomfft       - calculate the fft over a portion of the spectrum with any resolution任意分辨率的频谱傅里叶计算变换
    sphrharm      - calculate forward and inverse shperical harmonic transformations正向和反向球面谐波计算变换

 
  Probability Distributions概率分布

    berk2prob     - Convert Berksons to probability利用berk转换到probability概率
    gaussmix      - Fit a gaussian mixture model to data values拟合高斯混合模型的数据
    gaussmixd     - Calculate marginal and conditional density distributions and perform inference边际和条件密度推挤计算
    gaussmixk     - Estimate Kuleck-Leibler divergence between two GMMs两个高斯混合模型交叉熵散度估测
    gaussmixg     - Calculate global mean, covariance and mode of a Gaussian mixture高斯混合的全均值,协方差,模态计算
    gaussmixm     - Estimate mean and variance of GMM vector magnitude高斯混合模型向量幅度均值、方差估计
    gaussmixp     - Calculates and plots full and marginal probability density from a GMM高斯混合模型边缘概率密度的计算和绘制
    gaussmixt     - multiplies two GMMs together两个高斯混合模型相乘
    gausprod      - Calculate the product of multiple gaussians多个高斯结果的计算
    gmmlpdf       - OBSOLETE - use gaussmixp instead过时,使用gussmixp代替此函数
    histndim      - N-dimensional histogram (+ plot 2-D histogram)N维直方图(+绘制二维直方图)
    lognmpdf      - Prob density function of a lognormal distribution对数正态概率密度函数
    maxgauss      - Calculate the mean and variance of max(x) where x is a gaussian vector一个高斯向量均值或方差的最大值计算
    normcdflog    - Calculate the log of the Normal cdf without underflow没有下溢的正常CDF日志文件计算
    prob2berk     - Convert probability to Berksons利用probability概率转到berk
    randvec       - Generate random vectors产生随机向量
    randiscr      - Generate discrete random values with prescribed probabilities生成规定概率的离散随机值
    rnsubset      - Select a random subset选择的一个随机子集
    randfilt      - Generate filtered random noise without transients产生无瞬变的滤波随机噪声
    stdspectrum   - Generate standard audio and speech spectra生成标准音频和语音谱
    usasi         - Generate USASI noise (obsolete: use stdspectrum instead)过时,用stdspectrum函数代替
    v_chimv       - Approximate mean and variance of non-central chi distribution非中心分布的近似均值和方差
    vonmisespdf   - Calculate the pdf of the Von Mises (circular normal) distribution计算米塞斯分布(循环正常)的pdf

 
  Vector Distances向量距离

    disteusq      - Calculate euclidean/mahanalobis distances between two sets of vectors两个向量集合的欧式距离和马氏距离
    distchar      - COSH spectral distance between AR coefficient sets AR系数集之间的双曲余弦谱距离
    distitar      - Itakura spectral distance between AR coefficient sets AR系数集之间的Itakura谱距离
    distisar      - Itakura-Saito spectral distance between AR coefficient sets AR系数集之间的ltakura-Saito 谱距离
    distchpf      - COSH spectral distance between power spectra 功率谱间的双曲余弦谱距离
    distitpf      - Itakura spectral distance between power spectra 功率谱间的ltakura谱距离
    distispf      - Itakura-Saito spectral distance between power spectra 功率谱间的ltakura-saito谱距离
 

  Speech Analysis语音分析

    activlev      - Calculate the active level of speech (ITU-T P.56)估算语音的活跃程度
    activlevg     - Calculate the active level of speech robustly to added noise估算语音有力的加性噪声活跃程度
    dypsa         - Estimate glottal closure instants from a speech waveform语音波形声门闭合时刻估计
    enframe       - Divide a speech signal into frames for frame-based processing语音信号分成基于帧的分帧处理
    correlogram   - calculate a 3-D correlogram三维相关图计算
    ewgrpdel      - Energy-weighted group delay waveform延迟波形的能量给加权
    fram2wav      - Interpolate frame-based values to a waveform波形中插入帧值
    filtbankm     - Transformation matrix for a linear/mel/erb/bark-spaced filterbank from dft output 线性/梅尔/erb/bark-spaced滤波器组转换矩阵从偏流输出
    fxpefac       - PEFAC pitch tracker pefac基音跟踪
    fxrapt        - RAPT pitch tracker       rapt(图像?)基音跟踪
    gammabank     - Calculate a bank of IIR gammatone filters     IIRgammabakn滤波器计算
    importsii     - Calculate the SII importance function (ANSI S3.5-1997)SII重要函数计算
    modspect      - Caluclate the modulation specrogram  调制specrogram计算
    mos2pesq      - Convert MOS values to equivalent PESQ scores   MOS值等效转换到PESQ得分
    overlapadd    - Reconstitute an output waveform after frame-based processing重建一个基于帧处理后的输出波形
    pesq2mos      - Convert PESQ scores to equivalent MOS values  PESQ得分等效转换到MOS值
    phon2sone     - Convert signal levels from phons to sones信号电平从phons转换到sones
    psycdigit     - Experimental estimation of monotonic/unimodal psychometric function using TIDIGITS单调/单峰心理功能使用TIDIGITS实验估计
    psycest       - Experimental estimation of monotonic psychometric function单调心理功能函数实验估计
    psycestu      - Experimental estimation of unimodal psychometric function 单峰心理功能函数实验估计
    psychofunc    - Psychometric functions心理功能
    v_sigma       - Identify glottal closure and opening intstants from Lx or EGG waveform利用Lx或蛋波形识别声门的开闭
    snrseg        - Segmental SNR and Global SNR calculation分段信噪比和全信噪比计算
    sone2phon     - Convert signal levels from sones to phons信号电平sones转换到phons
    soundspeed    - Returns the speed of sound in air as a function of temperature返回声音在空气的速度于温度变化的函数
    spgrambw      - Spectrogram with many options声谱图的许多选项
    stoi2prob     - Convert STOI intelligibility measure to probability of correct recognition标准清晰度测量转换到正确识别概率
    txalign       - Align two sets of time markers两套时间标记集对齐
    vadsohn       - Voice activity detector语音活动侦测器
    v_ppmvu       - Calculate the PPM, VU or EBU levels of a signal计算信号的PPM、VU、EBU水平
 

  LPC Analysis of Speech 语音线性功能控制器LPC分析

    ccwarpf       - warp complex cepstrum coefficients复倒谱系数的变形
    lpcauto       - LPC analysis: autocorrelation method LPC分析 自相关法
    lpcbwexp      - Bandwidth expansion of LPC filter LPC滤波器的带宽扩展
    lpccovar      - LPC analysis: covariance method LPC分析 协方差分析
    lpcconv       - Arbitrary conversion between LPC representations LPC表示的任意转换
    lpcifilt      - inverse filter a speech signal语音信号的逆滤波器
    lpcrand       - create random stable filters创建随机稳定的滤波器
    lpcrr2am      - Matrix with all LPC filters up to order p矩阵用LPC滤波器到p阶
    lpcstable     - check for stability and force stable filters稳定滤波器的稳定和力量检查
    lpc--2--      - Convert between alternative LPC representation替代LPC表示的转换

 
  Speech Synthesis语音合成

    sapisynth     - Text-to-speech synthesis of a string or matrix 字符串的文本或矩阵到语音的合成
    glotros       - Rosenberg model of glottal waveform声门波形的罗森堡模型
    glotlf        - Liljencrants-Fant model of glottal waveform声门波形到liljencrants-Fant模型
 

  Speech Enhancement语音增强

    estnoiseg     - Estimate the noise spectrum from noisy speech using MMSE method利用最小均方差MMSE方法从噪音中估算噪声频谱
    estnoisem     - Estimate the noise spectrum from noisy speech using minimum statistics利用最小统计从噪音中估算噪声频谱
    specsub       - Speech enhancement using spectral subtraction采用谱减法增强语音
    ssubmmse      - Speech enhancement using MMSE estimate of spectral amplitude or log amplitude采用MMSE估计谐振幅或对数振幅增强语音
    ssubmmsev     - Speech enhancement using MMSE estimate and VAD-based noise estimation利用最小均方法估计法和基于VAD的噪声估计法增强语音
    specsubm      - (obsolete algorithm) Spectral subtraction 过时。谱减法
    spendred      - Speech Enhancement and Dereverberation (Doire's algorithm)语音增强和混响(doir算法)

 
  Speech Coding语音编码

    lin2pcmu      - Convert linear PCM to mu-law PCM线性PCM转换到μ律PCM
    pcma2lin      - Convert A-law PCM to linear PCM A律PCM转换到性PCM
    pcmu2lin      - Convert mu-law PCM to linear PCM μ律PCM转换到线性PCM
    lin2pcma      - Convert linear PCM to A-law PCM A律PCM转换到线性PCM
    kmeanlbg      - Vector quantisation: LBG algorithm矢量量化  LBG算法
    kmeanhar      - Vector quantization: K-harmonic means矢量量化 调和平均算法
    potsband      - Create telephone bandwidth filter电话带宽过滤器创建
    v_kmeans      - Vector quantisation: k-means algorithm矢量化 k均值聚类算法

 
  Speech Recognition语音识别

    melbankm      - Mel filterbank transformation matrix梅尔滤波器组变换矩阵
    melcepst      - Mel cepstrum frontend for recogniser梅尔倒频谱前端识别
    cep2pow       - Convert mel cepstram means & variances to power domain利用梅尔倒频谱均值和方差转换到功率域
    pow2cep       - Convert power domain means & variances to mel cepstrum利用功率域转换到梅尔倒频谱均值和方差
    ldatrace      - constrained Linear Discriminant Analysis to maximize trace(W\B)约束线性分析到最大限度跟踪
 

  Signal Processing信号处理

    ditherq       - Add dither and quantize a signal信号加抖动和量化(颤音?我自己猜想的)
    filterbank    - Apply a bank of IIR filters to a signal对信号应用IIR过滤器
    maxfilt       - Running maximum filter运行的最大值过滤器
    meansqtf      - Output power of a filter with white noise input带有白噪声输入的波滤器的的功率输出
    momfilt       - Generate running moments生成运行时刻
    schmitt       - Pass a signal through a schmitt trigger信号通过施密特触发器
    sigalign      - Align a clean refeence with a noisy signal对齐一个带有噪声信号的干净refeence
    teager        - Calculate the Teager energy waveform Teager能量波形计算
    v_addnoise    - Add noise to a signal at a chosen SNR 给信号加一个选择好的信噪比的噪声
    v_findpeaks   - Find peaks in a signal or spectrum在一个信号或谱中找到峰
    v_resample    - Resamples a signal: identical to MATLAB resample but removes filter transients重采样信号 和matlab自带重采样相同,但消除滤波器瞬变
    v_windinfo    - Calculate window properties and figures of merit窗口性能和数字优点计算
    v_windows     - Window function generation窗函数生成
    zerocros      - Find interpolated zero crossings查找插值零点(零点)用buffer分片以后的波形数据可以作为输入参数,返回是波形数据的y=0时线性求的x点集合。(点处斜率正zerocros(y,'p') 负 zerocros(y,'n')  默认全部或者'b')

 
  Information Theory信息理论

    huffman       - Generate Huffman code 生成哈夫曼编码
    entropy       - Calculate entropy and conditional entropy熵和条件熵的计算
 

  Computer Vision文本计算

    imagehomog    - Apply a homography transformation to an image with bilinear interpolation双性线插值图像的单应变换应用
    polygonarea   - Calculate the area of a polygon多边形面积计算
    polygonwind   - Test if points are inside or outside a polygon测试点在多边形的内部或外部
    polygonxline  - Find where a line crosses a polygon
    qrabs         - Absolute value of a real quaternion
    qrdivide      - divide two real quaternions (or invert one)
    qrdotdiv      - elmentwise division of two real quaternion arrays
    qrdotmult     - elmentwise multiplication of two real quaternion arrays
    qrmult        - multiply two real quaternion arrays
    qrpermute     - permute the indices of a quaternion array
    rectifyhomog  - Apply rectifing homographies to a set of cameras to make their optical axes parallel
    rot--2--      - Convert between different representations of rotations
    rotqrmean     - Find the average of several rotation quaternions
    rotqrvec      - Apply a quaternion rotation to an array of 3D vectors
    sphrharm      - forward and inverse spherical harmonic transform using uniform, Gaussian
                    or arbitrary inclination (elevation) grids and a uniform azimuth grid.
    upolyhedron   - Calculate the vertex coordinates and other characteristics of a uniform polyhedron

 
  Printing and Display functions打印展示函数

    axisenlarge   - Selectively enlarge figure axis for clarity
    cblabel       - Add a label onto the colorbar
    figbolden     - Make a figure bold and adjust colours for printing clearly
    fig2emf       - Make a figure bold and save as a windows metafile
    frac2bin      - Convert numbers to fixed-point binary strings
    lambda2rgb    - convert wavelength to XYZ or RGB colour triplets
    sprintsi      - Print a value with an SI multiplier
    sprintcpx     - Print a complex number with real and imaginary parts
    texthvc       - write text on a plot with specified alignment and colour
    tilefigs      - Arrange all figures on the screen
    v_colormap    - Set and plot colormap information
    xticksi       - Label x-axis tick marks using SI multipliers
    yticksi       - Label y-axis tick marks using SI multipliers
    xyzticksi     - Helper function for xticksi and yticksi
 

  Voicebox Parameters and System Interface音频工具参数和系统接口

    voicebox      - Global installation-dependent parameters
    unixwhich     - Search the WINDOWS system path for an executable program (like UNIX which)
    winenvar      - Obtain WINDOWS environment variables
 

  Utility Functions功能函数

    atan2sc       - arctangent function that returns the sin and cos of the angle反正切函数,返回sin和cos的角度
    bitsprec      - Rounds values to a precision of n bits
    choosenk      - All choices of k elements out of 1:n without replacement
    choosrnk      - All choices of k elements out of 1:n with replacement
    dlyapsq       - Solve the discrete lyapunov equation
    dualdiag      - Simultaneously diagonalise two hermitian matrices
    finishat      - Estimate the finishing time of a long loop
    fopenmkd      - like FOPEN() but creates any missing directories/folders
    hostipinfo    - Get information about the computer name and internet connections
    hypergeom1f1  - Confluent Hypergeometric function or Kummer's M function
    logsum        - Calculates log(sum(exp(x))) without overflow/underflow
    minspane      - calculate the minimum (or shortest) spanning tree
    mintrace      - find a row permutation to minimize the trace of a matrix
    m2htmlpwd     - Create HTML documentation of matlab routines in the current directory
    nearnonz      - Replace each zero element with the nearest non-zero element
    permutes      - All n! permutations of 1:n
    quadpeak      - Find quadratically-interpolated peak in a 2D array
    rotation      - Generate rotation matrices
    skew3d        - Generate 3x3 skew symmetric matrices
    zerotrim      - Remove empty trailing rows and columns
 
 %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%


voicebox 既是目录也是函数。


 voicebox  set global parameters for Voicebox functions Y=(FIELD,VAL)
 
   Inputs:  F   is a field name
            V   is a new value for the field
 
  Outputs:  Y   is set equal to the structure of parameters if the
                f and v inputs are both present or both absent. If only
                input f is specified, then y is set to the value of the
                corresponding field or null if it doesn't exist.
 
  You can override the defaults set here by setting the environment variable "voicebox"
  to the path of an m-file that contains lines like "% PP.dir_temp='F:\TEMP';"
 
  This routine contains default values for constants that are used by
  other functions in the voicebox toolbox. Values in the first section below,
  entitled "System-dependent directory paths" should be set as follows:
 
     PP.dir_temp     directory for storing temporary files
     PP.dir_data     default directory to preappend to speech data file names
                     when the "d" option is specified in READWAV etc.
     PP.shorten      location of SHORTEN executable. SHORTEN is a proprietary file compression
                     algorithm that is used for some SPHERE-format files. READSPH
                     will try to call an external decoder if it is asked to
                     read such a compressed file.
     PP.sfsbin       location of Speech Filing Sysytem binaries. If the "c" option
                     is given to READSFS, it will try to create a requested item
                     if it is not present in the SFS file. This parameter tells it
                     where to find the SFS executables.
     PP.sfssuffix    suffix for Speech Filing Sysytem binaries. READSFS uses this paremeter
                     to create the name of an SFS executable (see PP.sfsbin above).
  Other values defined in this routine are the defaults for specific algorithm constants.
  If you want to change these, please refer to the individual routines for a fuller description.



原文,使用help dirname

可以查看

因为我把那些.m文件放在voicebox里面,所以使用help voicebox 有如下

Voicebox: Speech Processing Toolbox for MATLAB
Some files have been prefixed "v_" to avoid name conflicts

Audio File Input/Output
readwav - Read a WAV file
writewav - Write a WAV file
readhtk - Read HTK waveform files
writehtk - Write HTK waveform files
readsfs - Read SFS files
readsph - Read SPHERE/TIMIT waveform files
readaif - Read AIFF Audio Interchange file format file
readcnx - Raed BT Connex database files
readau - Read AU files (from SUN)
readflac - Read FLAC files

Frequency Scales
frq2bark - Convert Hz to the Bark frequency scale
frq2cent - Convert Hertz to cents scale
frq2erb - Convert Hertz to erb rate scale
frq2mel - Convert Hertz to mel scale
frq2midi - Convert Hertz to midi scale of semitones
bark2frq - Convert the Bark frequency scale to Hz
cent2frq - Convert cents scale to Hertz
erb2frq - Convert erb rate scale to Hertz
mel2frq - Convert mel scale to Hertz
midi2frq - Convert midi scale of semitones to Hertz

Fourier/DCT/Hartley Transforms
rfft - FFT of real data
irfft - Inverse of FFT of real data
rsfft - FFT of real symmetric data
rdct - DCT of real data
irdct - Inverse of DCT of real data
rhartley - Hartley transform of real data
zoomfft - calculate the fft over a portion of the spectrum with any resolution
sphrharm - calculate forward and inverse shperical harmonic transformations

Probability Distributions
berk2prob - Convert Berksons to probability
gaussmix - Fit a gaussian mixture model to data values
gaussmixd - Calculate marginal and conditional density distributions and perform inference
gaussmixk - Estimate Kuleck-Leibler divergence between two GMMs
gaussmixg - Calculate global mean, covariance and mode of a Gaussian mixture
gaussmixm - Estimate mean and variance of GMM vector magnitude
gaussmixp - Calculates and plots full and marginal probability density from a GMM
gaussmixt - multiplies two GMMs together
gausprod - Calculate the product of multiple gaussians
gmmlpdf - OBSOLETE - use gaussmixp instead
histndim - N-dimensional histogram (+ plot 2-D histogram)
lognmpdf - Prob density function of a lognormal distribution
maxgauss - Calculate the mean and variance of max(x) where x is a gaussian vector
normcdflog - Calculate the log of the Normal cdf without underflow
prob2berk - Convert probability to Berksons
randvec - Generate random vectors
randiscr - Generate discrete random values with prescribed probabilities
rnsubset - Select a random subset
randfilt - Generate filtered random noise without transients
stdspectrum - Generate standard audio and speech spectra
usasi - Generate USASI noise (obsolete: use stdspectrum instead)
v_chimv - Approximate mean and variance of non-central chi distribution
vonmisespdf - Calculate the pdf of the Von Mises (circular normal) distribution

Vector Distances
disteusq - Calculate euclidean/mahanalobis distances between two sets of vectors
distchar - COSH spectral distance between AR coefficient sets
distitar - Itakura spectral distance between AR coefficient sets
distisar - Itakura-Saito spectral distance between AR coefficient sets
distchpf - COSH spectral distance between power spectra
distitpf - Itakura spectral distance between power spectra
distispf - Itakura-Saito spectral distance between power spectra

Speech Analysis
activlev - Calculate the active level of speech (ITU-T P.56)
activlevg - Calculate the active level of speech robustly to added noise
dypsa - Estimate glottal closure instants from a speech waveform
enframe - Divide a speech signal into frames for frame-based processing
correlogram - calculate a 3-D correlogram
ewgrpdel - Energy-weighted group delay waveform
fram2wav - Interpolate frame-based values to a waveform
filtbankm - Transformation matrix for a linear/mel/erb/bark-spaced filterbank from dft output
fxpefac - PEFAC pitch tracker
fxrapt - RAPT pitch tracker
gammabank - Calculate a bank of IIR gammatone filters
importsii - Calculate the SII importance function (ANSI S3.5-1997)
modspect - Caluclate the modulation specrogram
mos2pesq - Convert MOS values to equivalent PESQ scores
overlapadd - Reconstitute an output waveform after frame-based processing
pesq2mos - Convert PESQ scores to equivalent MOS values
phon2sone - Convert signal levels from phons to sones
psycdigit - Experimental estimation of monotonic/unimodal psychometric function using TIDIGITS
psycest - Experimental estimation of monotonic psychometric function
psycestu - Experimental estimation of unimodal psychometric function
psychofunc - Psychometric functions
v_sigma - Identify glottal closure and opening intstants from Lx or EGG waveform
snrseg - Segmental SNR and Global SNR calculation
sone2phon - Convert signal levels from sones to phons
soundspeed - Returns the speed of sound in air as a function of temperature
spgrambw - Spectrogram with many options
stoi2prob - Convert STOI intelligibility measure to probability of correct recognition
txalign - Align two sets of time markers
vadsohn - Voice activity detector
v_ppmvu - Calculate the PPM, VU or EBU levels of a signal

LPC Analysis of Speech
ccwarpf - warp complex cepstrum coefficients
lpcauto - LPC analysis: autocorrelation method
lpcbwexp - Bandwidth expansion of LPC filter
lpccovar - LPC analysis: covariance method
lpcconv - Arbitrary conversion between LPC representations
lpcifilt - inverse filter a speech signal
lpcrand - create random stable filters
lpcrr2am - Matrix with all LPC filters up to order p
lpcstable - check for stability and force stable filters
lpc--2-- - Convert between alternative LPC representation

Speech Synthesis
sapisynth - Text-to-speech synthesis of a string or matrix
glotros - Rosenberg model of glottal waveform
glotlf - Liljencrants-Fant model of glottal waveform

Speech Enhancement
estnoiseg - Estimate the noise spectrum from noisy speech using MMSE method
estnoisem - Estimate the noise spectrum from noisy speech using minimum statistics
specsub - Speech enhancement using spectral subtraction
ssubmmse - Speech enhancement using MMSE estimate of spectral amplitude or log amplitude
ssubmmsev - Speech enhancement using MMSE estimate and VAD-based noise estimation
specsubm - (obsolete algorithm) Spectral subtraction
spendred - Speech Enhancement and Dereverberation (Doire's algorithm)

Speech Coding
lin2pcmu - Convert linear PCM to mu-law PCM
pcma2lin - Convert A-law PCM to linear PCM
pcmu2lin - Convert mu-law PCM to linear PCM
lin2pcma - Convert linear PCM to A-law PCM
kmeanlbg - Vector quantisation: LBG algorithm
kmeanhar - Vector quantization: K-harmonic means
potsband - Create telephone bandwidth filter
v_kmeans - Vector quantisation: k-means algorithm

Speech Recognition
melbankm - Mel filterbank transformation matrix
melcepst - Mel cepstrum frontend for recogniser
cep2pow - Convert mel cepstram means & variances to power domain
pow2cep - Convert power domain means & variances to mel cepstrum
ldatrace - constrained Linear Discriminant Analysis to maximize trace(W\B)

Signal Processing
ditherq - Add dither and quantize a signal
filterbank - Apply a bank of IIR filters to a signal
maxfilt - Running maximum filter
meansqtf - Output power of a filter with white noise input
momfilt - Generate running moments
schmitt - Pass a signal through a schmitt trigger
sigalign - Align a clean refeence with a noisy signal
teager - Calculate the Teager energy waveform
v_addnoise - Add noise to a signal at a chosen SNR
v_findpeaks - Find peaks in a signal or spectrum
v_resample - Resamples a signal: identical to MATLAB resample but removes filter transients
v_windinfo - Calculate window properties and figures of merit
v_windows - Window function generation
zerocros - Find interpolated zero crossings

Information Theory
huffman - Generate Huffman code
entropy - Calculate entropy and conditional entropy

Computer Vision
imagehomog - Apply a homography transformation to an image with bilinear interpolation
polygonarea - Calculate the area of a polygon
polygonwind - Test if points are inside or outside a polygon
polygonxline - Find where a line crosses a polygon
qrabs - Absolute value of a real quaternion
qrdivide - divide two real quaternions (or invert one)
qrdotdiv - elmentwise division of two real quaternion arrays
qrdotmult - elmentwise multiplication of two real quaternion arrays
qrmult - multiply two real quaternion arrays
qrpermute - permute the indices of a quaternion array
rectifyhomog - Apply rectifing homographies to a set of cameras to make their optical axes parallel
rot--2-- - Convert between different representations of rotations
rotqrmean - Find the average of several rotation quaternions
rotqrvec - Apply a quaternion rotation to an array of 3D vectors
sphrharm - forward and inverse spherical harmonic transform using uniform, Gaussian
or arbitrary inclination (elevation) grids and a uniform azimuth grid.
upolyhedron - Calculate the vertex coordinates and other characteristics of a uniform polyhedron

Printing and Display functions
axisenlarge - Selectively enlarge figure axis for clarity
cblabel - Add a label onto the colorbar
figbolden - Make a figure bold and adjust colours for printing clearly
fig2emf - Make a figure bold and save as a windows metafile
frac2bin - Convert numbers to fixed-point binary strings
lambda2rgb - convert wavelength to XYZ or RGB colour triplets
sprintsi - Print a value with an SI multiplier
sprintcpx - Print a complex number with real and imaginary parts
texthvc - write text on a plot with specified alignment and colour
tilefigs - Arrange all figures on the screen
v_colormap - Set and plot colormap information
xticksi - Label x-axis tick marks using SI multipliers
yticksi - Label y-axis tick marks using SI multipliers
xyzticksi - Helper function for xticksi and yticksi

Voicebox Parameters and System Interface
voicebox - Global installation-dependent parameters
unixwhich - Search the WINDOWS system path for an executable program (like UNIX which)
winenvar - Obtain WINDOWS environment variables

Utility Functions
atan2sc - arctangent function that returns the sin and cos of the angle
bitsprec - Rounds values to a precision of n bits
choosenk - All choices of k elements out of 1:n without replacement
choosrnk - All choices of k elements out of 1:n with replacement
dlyapsq - Solve the discrete lyapunov equation
dualdiag - Simultaneously diagonalise two hermitian matrices
finishat - Estimate the finishing time of a long loop
fopenmkd - like FOPEN() but creates any missing directories/folders
hostipinfo - Get information about the computer name and internet connections
hypergeom1f1 - Confluent Hypergeometric function or Kummer's M function
logsum - Calculates log(sum(exp(x))) without overflow/underflow
minspane - calculate the minimum (or shortest) spanning tree
mintrace - find a row permutation to minimize the trace of a matrix
m2htmlpwd - Create HTML documentation of matlab routines in the current directory
nearnonz - Replace each zero element with the nearest non-zero element
permutes - All n! permutations of 1:n
quadpeak - Find quadratically-interpolated peak in a 2D array
rotation - Generate rotation matrices
skew3d - Generate 3x3 skew symmetric matrices
zerotrim - Remove empty trailing rows and columns

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%


voicebox 既是目录也是函数。

voicebox set global parameters for Voicebox functions Y=(FIELD,VAL)

Inputs: F is a field name
V is a new value for the field

Outputs: Y is set equal to the structure of parameters if the
f and v inputs are both present or both absent. If only
input f is specified, then y is set to the value of the
corresponding field or null if it doesn't exist.

You can override the defaults set here by setting the environment variable "voicebox"
to the path of an m-file that contains lines like "% PP.dir_temp='F:\TEMP';"

This routine contains default values for constants that are used by
other functions in the voicebox toolbox. Values in the first section below,
entitled "System-dependent directory paths" should be set as follows:

PP.dir_temp directory for storing temporary files
PP.dir_data default directory to preappend to speech data file names
when the "d" option is specified in READWAV etc.
PP.shorten location of SHORTEN executable. SHORTEN is a proprietary file compression
algorithm that is used for some SPHERE-format files. READSPH
will try to call an external decoder if it is asked to
read such a compressed file.
PP.sfsbin location of Speech Filing Sysytem binaries. If the "c" option
is given to READSFS, it will try to create a requested item
if it is not present in the SFS file. This parameter tells it
where to find the SFS executables.
PP.sfssuffix suffix for Speech Filing Sysytem binaries. READSFS uses this paremeter
to create the name of an SFS executable (see PP.sfsbin above).
Other values defined in this routine are the defaults for specific algorithm constants.
If you want to change these, please refer to the individual routines for a fuller description.






标签:real,Convert,Calculate,signal,中文,Speech,matlab,PCM,voicebox
From: https://blog.51cto.com/datrilla/5886058

相关文章

  • matlab带有自变量(参数)的累加求因变量
    这个代码需要小号很大的空间,如果数量大到一定的话,那么系统会内存占99%,然后电脑就宕机了。如果用时间换空间,那么多加及格循环就可以咯求解当t=0.5、0.75、1时函数f(t)=Σ......
  • matlab使用readmidi以后统计
    这个算法速度很慢就是了,更改算法后发现break和continue和我像的不一样,还是先保持这个全部遍历的clearall;[nmatnstr]=readmidi('再回首.mid');Cchannel=8;%统计nstr里面......
  • matlab注释分析高斯混合模型
    ​​Rachel-Zhang​​ 提供的源码。高斯混合模型没有输入参数判断,没有协方差是否可逆验证。我要用语音处理的,电脑卡死机,逆矩阵不是所有的都有的。或者用文库里面的代码​......
  • matlab纵向一维数组(向量)维数不一样尾部延展合成
    matlab纵向一维数据维数不一致合成两个语音波形数据简单合成一个试听播放sound(w,18000)sound(波形数据,采样频率)%两个维度不一样的纵向数组波形文件合成一个音轨%codeby......
  • matlab与C对照以及matlab之_极限_微分_积分_定积分
    名称matlabC++介绍脚本语言,类似科学计算器输入式子如果没有赋值默认赋值给ans,每条语句默认窗口输出计算结果编程语言,面向对象基于过程基本位置在.m文件(命令/函数文件)或命令......
  • matlab带UI界面编译成可执行文件问题汇总
    **********************************mcc全部直接无法使用我是下载的matlabR2014a然后出现mcc无法使用(即,随便一个file.m进行编译成可执行文件mcc-mfile.m都报错)我根据以......
  • matlab单帧频谱16个高斯混合拟合
    本来毕设是这个类型的(后来去了这个环节。总的来说也有所收获)看了各种论文,都是GMM-EM,概率论朝天。还会聚类分析预测一下,然后就是很多不懂的东西。我只知道我根据语音分析工具......
  • matlab倒计时启动录音
     本来毕设要用,后来没有用了function[y,fs2,noisy]=lrcrecorderV2(secs,fs,nbits,channel)%lrcrecorder根据采样频率fs和通道数channel录音lrcrecorderV2(secs,fs,nbit......
  • Spring 5 中文解析之测试篇-Spring测试介绍和单元测试
    微信公众号:测试本章介绍了Spring对集成测试的支持以及单元测试的最佳实践。Spring团队提倡测试驱动开发(TDD)。Spring的团队发现,正确使用控制反转(IoC)的确是简化单元测试和......
  • Spring 5 中文解析之测试篇-集成测试(下)
    3.6SpringMVC测试框架SpringMVC测试框架提供了一流的支持,可使用可与JUnit、TestNG或任何其他测试框架一起使用的流畅API测试SpringMVC代码。它基于​​spring-test​​......