User:Angeliki/Grad project speech analysis

From XPUB & Lens-Based wiki

Re- humanizing voice samples

Voice samples for training speech analysis software (LDC)

tracing the samples !! Using speech analysis software to verify voice samples (Germany/refugees entering)

what data: ordered samples or real samples (broadcast conversations, broadcast news, field recordings[air traffic, walking/noise background, ], meeting speech, microphone conversation, microphone speech, telephone conversations, telephone speech, transcribed speech, video) Example
from where: universities (of linguistics) around the world, research projects or satellites, radio Example
extracts of descriptions of the samples: "Transcripts have been made of all recordings in this publication, manually time aligned to the phrasal level, annotated to identify boundaries between news stories, speaker turn boundaries and gender information about the speakers.", "The audio files are 8 KHz, 16-bit linear sampled data, representing continuous monitoring, without squelch or silence elimination, of a single FAA frequency for one to two hours.", "The Air Traffic Control Corpus (ATC0) is comprised of recorded speech for use in supporting research and development activities in the area of robust speech recognition in domains similar to air traffic control (several speakers, noisy channels, relatively small vocabulary, constrained languaged, etc.) The audio data is composed of voice communication traffic between various controllers and pilots." Example
with permission from the users or not in the case of real samples Example