首页 > 其他分享 >CS425FZ Audio & Speech Processing

CS425FZ Audio & Speech Processing

时间:2024-12-14 20:32:52浏览次数:4  
标签:Plot plot Processing window should waveform CS425FZ Speech Audio

CS425FZ (Audio & Speech Processing) Assignment 1 (value 20%)

Released date: Tuesday 26 th November 2024Due date: Sunday 15th December 2024 at 23:59This is an open-book, graded assignment. Please cite all references as comments in yoursubmissions. You cannot directly reuse a solution from online sources or AI. You must not engagewith another student, in person or electronically (via phone, social media, etc.), to secureassistance with this Assignment. If you do so (even for only one of the questions), you will receivean automatic failure (0%), and it will also be reported to the Executive Vice-Dean of MIEC and/orMaynooth University Plagiarism board. We will perform similarity checkson submittedassignments to check for collaborative efforts. The lecturer reserves the right to interview youabout your submission in special cases. It should be mentioned that the Turnitin tool provided inMoodle can detect AI-generated context.The first assignment is to use the programs in Java, processing, Python, or Octave/MATLAB toprepare a narrative on digital waveforms and spectral analysis using the FFT and the Spectrogramto demonstrate your knowledge of how they work. Make sure that each plot can clearly illustratethe shape of the waveform, i.e. if you have to zoom in to get this, do so. Athick coloured block isnot acceptable. The results from your plotting should be placed into a PowerPoint presentation,and along with the plot, a sound file should be inserted into the page (it should be imported asmp3 to save space). The documents should also show on the following slide to each plot theprogramming scripts to generate the wave and its graph. The graphs should have titles, labelledaxes and a caption in the document (e.g. Figure 1, Figure 2).

Waveforms

  1. Generate and plot one example of the waveform of a sinusoid at a frequency, amplitude,and phase of your choice. Show the waveform from time t=0. Select the frequency of thesinewave from the set of musicalnoteshttps://homes.luddy.indiana.edu/donbyrd/Teach/MusicalPitchesTable.htmGenerate and plot an example of waveforms composed of sinusoids at harmonicallyrelated frequencies to create either a sawtooth wave, a square wave or a triangle wave.
  1. Read in a wav file of an “effect”/natural sound and plot only 20 seconds of it.
  2. Read in a wav file of a Speech utterance (it could be from the web or recorded by yourself)and plot it (approx. 2-5 seconds), put the text of the utterance in the title of the plot.Page 2 of 2

Fourier transform

  1. Plot the magnitude of the Fourier transform (FFT) of a signal composed of more than onesinusoid of different frequencies and amplitudes using a rectangular window. Use an FFTlength of N=256 and then N=2048.
  1. Plot the magnitude of the Fourier transform (FFT) of the same signal composed of morethan one sinusoid of different frequencies and amplitudes using a Hanning window. Use anFFT length of N=256 and then N=2048.
  1. Record at least 1 second of you saying代写CS425FZ Audio & Speech Processing  any vowel sound using Audacity or an equivalentsoftware. Use the editor to retain only the steady portion of the vowel waveform. Plot thmagnitude of the Fourier transform of this, picking a suitable value for N (e.g. 256, 512,1024 2048) so that it is easy to identify at least two formant peaks form the spectrum.

Spectrogram

  1. Plot the spectrogram of the speech waveform you used earlier for a short window N=256and a long window N=1024. Identify the voiced and unvoiced speech in the plot.
  1. Plot the spectrogram of a sound effect that has distinctive frequency components, e.g. abird sound, a chainsaw, a car starting, clock strike. Pick an appropriate window length forthe frequency components to be clearly displayed. Make sure to mention the windowlength in the title of the plot
  1. Plot the spectrogram of a short drum loop of your choice with N=256 and N=2048 to showthat the shorter window means a better time resolution, and thus, the points in time of thedrum hits are easier to discern. Point this out in the figure in its caption.

 

标签:Plot,plot,Processing,window,should,waveform,CS425FZ,Speech,Audio
From: https://www.cnblogs.com/CSE2425/p/18607063

相关文章

  • H5中,audio和video标签属性的详细介绍
    <audio>和<video>标签是HTML5中用于嵌入音频和视频内容的元素,它们为网页提供了原生的多媒体支持,允许用户在不依赖第三方插件的情况下直接播放音频和视频文件。1. <audio> 标签<audio>标签用于在网页中嵌入音频文件,支持多种格式,如MP3、OGG和WAV等。它可以让用户直......
  • 你有使用过H5的Web Audio API吗?说说它的用途
    在前端开发中,H5的WebAudioAPI是一个非常强大且灵活的工具,它允许开发者对网页上的音频进行精细的控制和处理。以下是对WebAudioAPI的详细介绍:一、WebAudioAPI的基本用途WebAudioAPI提供了一个在Web上控制音频的有效通用系统,允许开发者自选音频源,对音频添加特效,使音频可视......
  • 使用QAudioDecoder + QAudioSink实现音频播放
    要实现QAudioDecoder+QAudioSink的音频播放,主要是将MP3、AAC等压缩格式的音频文件,使用QAudioDecoder解码成PCM格式,然后通过QAudioSink播放出来。QAudioSink基本概念QAudioSink是Qt6中用于播放音频的类。它提供了低级别的接口,可以直接控制音频数据的播放和输出到......
  • Bluetooth LE AUDIO架构概述
    背景BluetoothLEAUDIO是蓝牙5.2版本引入,LEAUDIO的引入不仅仅只是在功耗和延迟方面相对于经典蓝牙音频有改善,最大的创新在于应用场景的拓展,尤其是LE广播音频Auracast的引入,丰富了蓝牙Audio的应用场景,下面列举广播音频几个典型的应用场景:场景描述公共场所广播LEAudio广......
  • Audio source的两种获取
    对于unity中获取音频的方法:audioSource=gameObject.AddComponent<AudioSource>();和audioSource=GetComponent<AudioSource>(); 方法说明方法audioSource=gameObject.AddComponent<AudioSource>()动态添加一个新的AudioSource组件到当前对象上当前对象没有Audi......
  • 《鸣潮》进入游戏后弹窗“错误代码[126]:加载 x3daudio1_7.dll 失败,该文件缺失或损坏!
     《鸣潮》进入游戏后弹窗“错误代码[126]:加载x3daudio1_7.dll失败,该文件缺失或损坏!”是什么原因?“x3daudio1_7.dll文件缺失”要怎么解决?作为一位软件开发从业者,深知电脑游戏运行过程中可能会遇到的各种问题,其中文件丢失、文件损坏和系统报错尤为常见。最近,不少《鸣潮》......
  • 【多进程】进程:multiprocessing
    Python使用multiprocessing实现多进程_python多进程multiprocessing-CSDN博客使用场景在计算密集型场景下使用进程来实现,因为进程跳过了全局解释器锁(GIL:确保了任何时候只有一个Python线程在执行Python字节码)实现了对CPU多核的利用,实现真正的并行运行。而且进程是计算机下的应用......
  • 【多进程】multiprocessing.Manager
    multiprocessing.Manager() 是Python multiprocessing 模块中的一个功能,它提供了一个服务器进程,该进程可以创建和管理跨多个Python进程共享的对象。这个管理器使得你可以创建像列表(list)、字典(dict)、锁(Lock)、事件(Event)等可以被多个进程安全访问和修改的对象。使用场景共享数......
  • AudioContext有什么应用场景?
    AudioContext在前端开发中有很多应用场景,主要围绕着处理和操作音频。以下是一些常见的例子:音频可视化:可以使用AudioContext的AnalyserNode来获取音频的频率和时间域数据,然后用这些数据来创建各种视觉效果,例如频谱图、波形图等,以响应音乐或其他音频的播放。游戏音效:可......
  • 实例分析BLE AUDIO VCP
    背景VCP也就是volumecontrolprofile的缩写,是LEAUDIO音量控制服务,类似与AVRCP的音量控制,LEAUDIO也存在gateway和controlclient,可以这样理解在gateway上运行了VCS(volumecontrolservice),VCOS(volumecontroloffsetservice)和AICS(audioinputcontrolservice),这三个服......