User:Angeliki/Grad project speech analysis: Difference between revisions

Revision as of 14:04, 11 November 2018

Re- humanizing voice samples

Voice samples for training speech analysis software (LDC). Tracing the samples	Using speech analysis software to verify voice samples
what data: ordered samples or real samples (broadcast conversations, broadcast news, field recordings[air traffic, walking/noise background, ], meeting speech, microphone conversation, microphone speech, telephone conversations, telephone speech, transcribed speech, video)	examples of verification: diagnostic tool(for disease, depression), personal assistants (humanize the software voice), refugees seeking asylum/verification of claims of origin/Germany
from where: universities (of linguistics) around the world, research projects or satellites, radio	Example
extracts of descriptions of the samples: "Transcripts have been made of all recordings in this publication, manually time aligned to the phrasal level, annotated to identify boundaries between news stories, speaker turn boundaries and gender information about the speakers.", "The audio files are 8 KHz, 16-bit linear sampled data, representing continuous monitoring, without squelch or silence elimination, of a single FAA frequency for one to two hours.", "The Air Traffic Control Corpus (ATC0) is comprised of recorded speech for use in supporting research and development activities in the area of robust speech recognition in domains similar to air traffic control (several speakers, noisy channels, relatively small vocabulary, constrained languaged, etc.) The audio data is composed of voice communication traffic between various controllers and pilots."	Example
with permission from the users or not in the case of real samples	matter of privacy, de-humanizing automated processes regarding control of the body

@@ Line 1: / Line 1: @@
 == Re- humanizing voice samples ==
-{| class="wikitable"
+{|
 |-
-! Voice samples for training speech analysis software ([https://catalog.ldc.upenn.edu/search LDC])<br />
+! Voice samples for training speech analysis software ([https://catalog.ldc.upenn.edu/search LDC]). Tracing the samples !! Using speech analysis software to verify voice samples  <br />
-tracing the samples !! Using speech analysis software to verify voice samples (Germany/refugees entering)
 |-
-| what data: ordered samples or real samples (broadcast conversations, broadcast news, field recordings[[https://catalog.ldc.upenn.edu/LDC94S14A air traffic], [https://catalog.ldc.upenn.edu/LDC2015S08 walking/noise background], ], meeting speech, microphone conversation, microphone speech, telephone conversations, telephone speech, transcribed speech, video) || Example
+| what data: ordered samples or real samples (broadcast conversations, broadcast news, field recordings[[https://catalog.ldc.upenn.edu/LDC94S14A air traffic], [https://catalog.ldc.upenn.edu/LDC2015S08 walking/noise background], ], meeting speech, microphone conversation, microphone speech, telephone conversations, telephone speech, transcribed speech, video) || examples of verification: [https://www.dw.com/en/voice-analysis-an-objective-diagnostic-tool-based-on-flawed-algorithms/a-17187057 diagnostic tool(for disease, depression), personal assistants (humanize the software voice)], [https://gizmodo.com/experts-worry-as-germany-tests-voice-recognition-softwa-1793424680 refugees seeking asylum/verification of claims of origin/Germany] <br /><br />
 |-
-| from where: universities (of linguistics) around the world, research projects or satellites, radio|| Example
+| from where: universities (of linguistics) around the world, research projects or satellites, radio|| Example<br /><br />
 |-
-| extracts of descriptions of the samples: "Transcripts have been made of all recordings in this publication, manually time aligned to the phrasal level, annotated to identify boundaries between news stories, speaker turn boundaries and gender information about the speakers.", "The audio files are 8 KHz, 16-bit linear sampled data, representing continuous monitoring, without squelch or silence elimination, of a single FAA frequency for one to two hours.", "The Air Traffic Control Corpus (ATC0) is comprised of recorded speech for use in supporting research and development activities in the area of robust speech recognition in domains similar to air traffic control (several speakers, noisy channels, relatively small vocabulary, constrained languaged, etc.) The audio data is composed of voice communication traffic between various controllers and pilots."  || Example
+| extracts of descriptions of the samples: "Transcripts have been made of all recordings in this publication, manually time aligned to the phrasal level, annotated to identify boundaries between news stories, speaker turn boundaries and gender information about the speakers.", "The audio files are 8 KHz, 16-bit linear sampled data, representing continuous monitoring, without squelch or silence elimination, of a single FAA frequency for one to two hours.", "The Air Traffic Control Corpus (ATC0) is comprised of recorded speech for use in supporting research and development activities in the area of robust speech recognition in domains similar to air traffic control (several speakers, noisy channels, relatively small vocabulary, constrained languaged, etc.) The audio data is composed of voice communication traffic between various controllers and pilots."  || Example<br /><br />
 |-
-| with permission from the users or not in the case of real samples || Example
+| with permission from the users or not in the case of real samples || matter of privacy, de-humanizing automated processes regarding control of the body
 |-
 |}