User:Angeliki/Grad project loop: Difference between revisions

From XPUB & Lens-Based wiki
(Created page with "what: I want to make visible the human presence in the loop of the communication systems that has been gradually formed the last years of the rising of social media and machin...")
 
No edit summary
 
(32 intermediate revisions by 2 users not shown)
Line 1: Line 1:
what:
<big style= "color:red;">[UPDATE: My project now is focusing on the exclusion of female voices from public sphere. My explorations regard their amplification, through their mediation] </big>
I want to make visible the human presence in the loop of the communication systems that has been gradually formed the last years of the rising of social media and machines. In our post-human era “we loose the basic distance which makes us human” in “the prospect of the direct link between our brain and the digital network” (The Economist, 2018). It seems that we are more than distant users using a communication tool. We are part of the circuit. Thinking back in time about the communication diagram of Shannon Claude1 the one way communication. The process described in this diagram has become more complex and the new data practices have intervened into the communication system making eternal loops (as shown in the diagram annotated by me). We are already part of it and we cant see it. We are part of the loop, we experience it everyday and we can intervene in it. As Zizek observes, it is very difficult for the common person to understand how algorithms work “but we can easily understand how we are controlled by the digital grid” (The Economist, 2018).
I want to highlight several qualities regarding this process, like materiality, time (delay) and embodiment, that are disappearing and altered into this massive and quick automated process. For example, it is commonly unknown that the most important part of making an intelligent ‘assistant’ is the contribution of huge data of human voices. I want to engage with these tools and at the same time to try practices that we can understand where are we in this loop and intervene into it.
Speech recognition and analysis tools are trained by a database of 'real' voice samples. These samples come from different sources and communication systems around the world [imitations of conversations, radio broadcasts, telephone conversations, field recordings, readings] sometimes with the permission of the people donating their voice. At the same time these tools (like automatic dialect analysis) are used from states [Germany] to verify the claims of origin of refugees. It is very often that they can get wrong because "Identifying the region of origin for anyone based on their speech is an extremely complex task" and depends on "a wide range of factors". Google also is using voice recordings from other google apps to train its speech-to-text tool.


== Human always in the communication loop ==
My title has derived from the term Human-in-the-loop (HITL) that “is a branch of artificial intelligence that leverages both human and machine intelligence to create machine learning models” (Anadiotis, 2018).
=== What do I want to make? ===
I want to make a series of experiments involving participants that will imitate activities happening parallelly to our communication systems. One possible outcome of this would be an online database with all these practices and material that will be produced. It could work as a toolkit for me and other people that are interested to get involved. Another possible outcome is a media object that will assist collective practices like that to happen.<br />
The diagram of Shannon Claude of 1948, shown below, presents how a general communication system works. The message is passed through a transmitter that encodes it and transfers it through a one way channel to the receiver. The latter decodes the message and the person from the other side listens to or reads it. This way of communication, that the technology of the previous century created, brought a new relationship with our bodies. One of the most intense experiences of it was the detachment of our voice from our physical bodies. Our voice being mediated through different kinds of media gets distorted, sounds unfamiliar and gets disconnected from us. <br />
<img src="https://monoskop.org/images/5/5e/Shannon_Claude_E_1948_General_communication_system_diagram.jpg" style="max-width:50%;">
Nowadays, the new data practices of big corporations like Google and Facebook are intervening into this communication system, listening to, transcribing, documenting our voices and messages. It is commonly unknown that the most important part of making and training an intelligent ‘assistant’ like Amazon's Alexa, that is based on speech recognition tool, is the contribution of huge data of human voices. The samples used by the corporations come from different sources and communication systems around the world. They may be imitations of conversations with microphone, radio broadcasts, telephone conversations and field recordings. At the same time these tools (like the automatic dialect analysis) are used from the state of Germany to verify the claims of origin of refugees. It is very often that they can get wrong because "Identifying the region of origin for anyone based on their speech is an extremely complex task" and depends on "a wide range of factors"(Welle (www.dw.com), 2017). Google also is using voice recordings of the users from other google apps to train its speech recognition tool.<br />
<img src="https://ichef.bbci.co.uk/news/660/cpsprodpb/147DB/production/_95213938_gettyimages-164047267.jpg" style="max-width:40%;"> Refugee interviewed<br />
So, based on this contemporary condition, I annotated the diagram with possible interventions of the corporations of today. Even though the outcome of data practices is a product/machine, a huge amount of human activities are related in order to train it, like transcribing, recording and speaking. The one way channel becomes a loop, where the user communicating is also ‘input’ for the machine that will provide her/his tool of communication. We are more than users using various technical mediums, we are part of the loop. <br />
As Zizek observes, it is very difficult for the common person to understand how algorithms work “but we can easily understand how we are controlled by the digital grid” (The Economist, 2018). I want to make visible the human presence in the loop by understanding it and share it in a physical space in real time with our physical presence.


How:
[[File:Diagram-communication-loop.JPG]]
I want to make visible this condition (of human always on the loop) through a collection of material related to trained data of speech recognition tools [voice samples, ways of collection]5 and practices that re-enact the loop with the inclusion of the embodied awareness. These practices may be related to the human processes involved in the loop, like the presence of human annotators [transcription], listening and identifying voices, decoding and encoding but also other elements like the delay in the circuit and the spatial qualities that distort the message. 
Some voice samples that I found from an organisation involved for the training of a speech recognition tools [pocketsphinx]:
And this is an example of a description accompanied these voice samples that shows the way that a voice sample was made:
...Its format consists of invited guests (most often politicians) spontaneously answering topical questions posed by one or two interviewers. The number of interviewees in a single program varies from one to three, but typically, one interviewer and two interviewees appear in the program. The material includes passages of interactive dialogue, but longer stretches of monologue-like speech comprise the majority of the collected data. Radioforum also has an interactive segment where listeners call the studio and ask their own questions.
This may be part of the practices that will imitate the processes.
These are exercises of sharing and following collectively instructions that we get connected with the inner functions of the body, like circulation, electromagnetism and vibrations, by reading, listening and moving.
During deep listening sessions in my course I became the medium [encoding/decoding]sharing information with my colleagues. You wear headphones and listen to a video and audio of Pauline’s instructions on deep listening, so you listen to the audio and copy what you see and repeat what you hear. You become the medium. When Clara did it it seemed as if it was coming from her head as we didn’t listen to the audio and she was continuously repeating. When you do this you don’t have time to interpret the text, and at the same time you are doing things with your body, you become detached from yourself and become another person. 
I felt I became another person, I didn't have time to think of my position in the exercise; I understood the detachment as when you talk through a medium and listen back to your voice and thus it becomes another person, an alien voice which is distorted by the medium. My body and voice transmitted but also distorted the voice of Pauline Oliveros. This mediation happens at that moment. I am fascinated by the embodiment in that technology, to become a mechanical device. I had not listened to these instructions before so I was alert and unable to interpret fast. It is a difficult task to transmit and to receive and this is where I understood myself as a medium. As with the telephone the medium needs time. There is a slight delay. This is becoming faster, but there is an inbetween space which pushes your voice somewhere else. This is telepresence. To try these small experiments everyday I think I will find a place between our bodies and the technology of mediation. 
I think to start grasp and control a reality that we are already in [human-in-the-loop] we should understand through our bodies and processes happening with our bodies. So what I mean for example I become the medium. So even the simple processes of reading and listening as we are receivers or transmitters.
But also transcriptions, speech recognition and involve more people.


A metaphor of the loop: asking personal questions and then everyone else transcribing and spreading the message
=== How do I plan to make it? ===
One is the transmitter another one the receiver and in between
I plan to research on data practices and databases of voices and transcriptions to find out what practices are related to it and what I can borrow from them. One example of database <ref>https://catalog.ldc.upenn.edu/search</ref> that I find interesting is one made by an organisation collecting voice samples with various ways and from various sources and medium. Some voice samples and transcriptions from that database:
{|
|-
| {{#Widget:Audio|mp3=https://catalog.ldc.upenn.edu/desc/addenda/LDC2017S17.flac}}<br />
(microphone conversation)
<pre style="white-space: pre-wrap;">
Interview 15
(A=Interviewer; B=Interviewee)
A: So we are recording.  Awesome.  So how long have you lived in Flint? (unclear)
B: 38 years.
A: Is that your whole life?  Wow you look really young.
B: Thank you!(...)
</pre>
|-
|{{#Widget:Audio|mp3=http://pzwiki.wdka.nl/mw-mediadesign/images/d/df/LDC2015S08_walking.mp3}}<br />
(telephone conversation/ giving directions on spot while walking)
|}<br />


What is my timetable:
And this is an example of a description accompanied these voice samples that shows the way that a voice sample was made/collected:
Until the end of January:
''...The number of interviewees in a single program varies from one to three, but typically, one interviewer and two interviewees appear in the program. The material includes passages of interactive dialogue, but longer stretches of monologue-like speech comprise the majority of the collected data...''. This material can be used for the actions I want to make. <br />
1. Trying these out in the public space including walking, disrupting the public rhythm and documenting observations, photos, videos, audio or text produced.
2. Do more experiments on reading, listening, repeating, walking with two or more people. Creating a metaphor with the body for the feedback structure of radio. The actions will be collective readings, deep listening exercises with a focus on the relation with the radio. Documenting observations and audio, video, photos, text produced
In the beginning of February:
3. Merging the two sides of the experiments. How the one knowledge affects/informs the other. What new knowledge can be gained.  
4. Involve more people in the process. Invite them to share their personal relation to these experiments [how they perceive the mediation of their voice and their parallel presence] and use this material in the process. Merge my process with facts related to the city and the people involved (where is that process important or not). People that have a difficulty on language and people that are afraid of surveillance and frequencies or they perceive them as an alien fact. People that learning this technical aspect of radio, but also do experiments like that, will be useful for their daily lives and struggles. People that I meet at my outside experiments.
March:
proceed with what all the previous experiments will bring.  


At the same time exercises like the deep listening sessions of Pauline Oliveros <ref>http://deeplistening.org/site/content/workshops</ref> can be a daily experimentation where I can find a place between our bodies and the technology of mediation. How a human transfers the message as becoming the medium? These exercises are about sharing and following collectively instructions that we get connected with the inner functions of the body, like circulation, electromagnetism and vibrations, by reading, listening and moving. My personal experience of it that connects this exercise with the mediation of the voice was that: I was listening with headphones to the instructions of Oliveros and I was repeating what she was saying. The other participants were following the instructions coming from my mouth. I felt that I became the medium that didn't have time to interpret or intervene with my own personality into the message as it was new for me. I mediated the voice of Oliveros like a telephone would do. <br />


why (transcription):
I want to relate the data practices with the collective exercises on a way that will activate the loop I described with the human presence. Some of the actions will involve human annotators [transcription], listening and identifying voices, decoding and encoding, donating voice samples.<br />
This era of post-humanity is about the “expanding role of science, machines and digital media in social control and regulation” (The Economist, 2018). We think that we may have a safe distance from our mediated communication and we perceive ourselves mostly as users but we don’t really understand ourselves as part of it. Parts of our personal elements, like voiceprints, are fed as data to the system. I think the shift of this perception towards our embodied presence in the communication loop can open up possibilities for the relation we have with it.
Our communication and our relation with these tools have been mostly formed by big corporations or the state, military, male, science. It seems to me that the communication platforms are estranged3 realities where the body [cultural, physical, political, gender] disappears4. They are technically complex and there is a mystery around them [like for speech recognition training data-machine learning, radio]
My purpose is to redefine those mediums by appropriate them in a way that follows up the "situation"/position (cultural, political) of a person. To redefine them more conceptually and as part of movements.
Actually language is a technology and has specific rules and has specific requ­irements. We use very specific parts of the brain and rules/codes[orality] that we communicate and it is a non human/distant technology that our body is involved and have adjusted in this. And we know to be involved in it because it is an old medium ad we have realized that with a lot of experience and communication.  Where is the materiality of these technologies and where our bodies are in it. I think that by approaching the distortion we have with all these different technologies I feel that we should be able to strengthen these relationships we have with these technologies and not only as users [the most dominant idea]. It is just a tool but actually we are part of the tool and without us the circulation doesnt work so why not to have another way/ its not about using a technology, a space, a body. It is something more. I want to engage these technologies that are not abstract terms.  We see them as alien spheres as other persons [like Alexa and Siri]


I want to do this project because we are gradually and rapidly donate our personal data in big organisations in the shake of the "public good" without thinking of it [ethics, politics on our bodies and social behaviour]. I also observe that all these new politics that enter our private and collective spheres estrange [2] us from our relations with the others and our surroundings/ dehumanize our lives. Especially I am interested in voice because our voice is a personal and unique element very related to our bodily conditions. For oral cultures the voice was a medium to spread knowledge, on a way that differs a lot from the writing cultures, "When auditory experiences are shared, histories too are shared, and not only from mouth to ear: they are perceived by and encoded in the body through the physicality of sound waves and passed on from one generation to another."[3] "should we be worried about the large-scale harvesting of our voiceprints?" "The companies behind this technology say that a voiceprint includes more than 100 unique physical and behavioural characteristics of each individual, such as length of the vocal tract, nasal passage, pitch, accent and so on. They claim it is as unique to an individual as a fingerprint, and that their systems even recognise people if they have a cold or sore throat."[4]. "Your voice is yours alone – as unique to you as your fingerprints, eyeballs and DNA."
The documentation of all these will resemble the databases I will find but with a different purpose than training machines.  




=== What is my timetable? ===
Until the end of January:
Conducting interviews with people related in collecting voice data or engaged with and re-appropriating communication technologies. I will participate in workshops or lectures related to these topics throughout the year, like CCC (Chaos Communication Congress). I will research on recordings/databases and ways of collecting voice samples for training. At the same time I will experiment with mediation in private and public spaces. I will also do more experiments on collective reading, listening, repeating, walking.
I will document observations and audio, video, photos, text produced.<br />
26/11: in my pyratechnic session with Alice I will prepare a loop process in which the students will try different actions related to the loop in my diagram.<br />
30/11: I will try the same in Leeszaal, where people from different countries and backgrounds visit. <br />
I will decide what the outcome will be.
<br />
February:
I will relate the dynamics of the previous experiments and I will proceed with involving more people in the process. Invite them to share their personal relation to these experiments (how they perceive the mediation of their voice and their presence in the loop) and use this material in the process. Merge my process with facts related to the people involved (where is that process important or not). <br />
March-April:
Considering the documentation and outcome of my processes I will re-think my next steps and select part of the practices that have more impact.<br />
May-June:
Wrap up


=== Why do I want to make it?===
I believe that all these procedures happening on a daily base with or without our knowledge. They have an affect on our physical bodies that we unconsciously skip. I want to create an experiential model of collective sharing of our voices based on the system I am researching on and described earlier. My purpose is to strengthen our awareness for our involvement in the loop and answer in some questions that bother me. I want to investigate these questions with the groups of people that will be involved in the process. Some of these questions is how our bodies are influenced by these systems, what is the control over them, what are the new relation with our voices and how can we appropriate these new technologies more consciously, what new material and tools these experiments produce?<br />
Our communication and our relation with these systems have been mainly formed by big corporations or the states and their character is mostly male technical or military. It seems to me that the communication platforms are estranged<ref>estranged= 1. To make hostile, unsympathetic, or indifferent; 2. To remove from an accustomed place or set of associations</ref> realities where the personal body [cultural, physical, political, gender] disappears</ref> realities difficult to understand. The context (social, cultural background, gender) that defines an individual is disappearing or manipulated into this massive automated process happening throughout our communication systems. Our data and bodies the same. I am interested especially in voice because it is a personal and unique element carrying many things from the personality. For oral cultures the voice was a medium to spread knowledge, on a way that differs a lot from the writing cultures, "When auditory experiences are shared, histories too are shared, and not only from mouth to ear: they are perceived by and encoded in the body through the physicality of sound waves and passed on from one generation to another"(Public Radio - documenta 14, 2017). <br />
Technology becomes an extension of this desire to reach the invisible and distant, something beyond the limitations of your own body. But when I talk about detachment I don’t mean it in a negative sense; there is first an alienation and frustration and distancing, but if we understand it with our body we understand this communication, so it is a way of understanding media through simple techniques as, for instance, just ‘repeating’ a youtube video or transcribing the voice of our interlocutor.


Who can help me and how:
=== Who can help me and how? ===
-People I interview like Reni Hofmüller that can help me on experimenting with the specific technology and imagining other aspects of radio.  
People I interview like Reni Hofmüller can help me on experimenting with the specific technology of mediation and imagining other aspects of them. Also, Raadio Caargo can help me imagining potential futures of the mediums by engaging with feminist methodologies and practices. Joana, a former student can help me with prototyping, references and discussion on embodied and distant voice. My tutors Amy and Clara with deep listening exercises. My XPUB colleagues with participating in my first experiments.
-Raadio Caargo can help me imagining potential futures [feminist futurotopias as they call it] by engaging with different related methodologies and practices. As they say: "reclaim our relationships with techniques and technologies, to subvert these techniques, to blur genders and blow
-Joana with prototyping [making embroidered antennas], references and discussion on embodied and distant voice  
-Amy and Clara with deep listening  
- Michael with the software of speech recognition [training, hacking,...]


=== Previously ===
I move towards this direction the last years. I worked with collective writing, speech recognition, transcription, collective annotation and collective reading. It is very often in my work that I am interested in the mediation of the voice. In some of my projects I used several tools that were relating different participants in different places through the sharing of their voice. 


Larger context:
=== Relation to a larger context ===
Human-in-the-loop or HITL is defined as a model that requires human interaction. It is used in machine learning. Human-in-the-loop allows the user to change the outcome of an event or process. HITL is extremely effective for the purposes of training because it allows the trainee to immerse themselves in the event or process. 6
There are several attempts- from feminists, artists and programmers- of approaching hacking, technological cultures from a more feminist approach that involves the body and the vulnerabilities of the individual. One example is the collective Hacking With Care that include practices of taking care of their body while working as hackers. Another one is the work  r∆∆dio c∆∆rgo and Spideralex that reclaim their relationships with technologies by engaging with different collective methodologies and practices. At my point of view these attempts aim to subvert the dominant uses and relations with the technology and propose other ways of communicating.
The encoding and decoding model of communication “In contrast to other media theories that disempower audiences, Hall proposed that audience members can play an active role in decoding messages as they rely on their own social contexts, and might be capable of changing messages themselves through collective action.”2 The message can be “interpreted differently from person to person. ...decoding is the process of obtaining, absorbing, understanding, and sometimes using the information that was given throughout a verbal or non-verbal message.
In the old times telephone operators, mostly women, were working underpaid in a stressful position. They were invisible and their voices were representing a specific feminine character that was promoted by their companies, serving their clients. I think one contemporary version of this labour is the contribution of people in databases for training a machine. Though the conditions are better and different, the morals around it are under consideration. "(S)hould we be worried about the large-scale harvesting of our voiceprints? (…) The companies behind this technology say that a voiceprint includes more than 100 unique physical and behavioural characteristics of each individual " (Jones, 2018). At the other side of this machines is the effect they have on decisions regarding people’s lives and access in spaces. For example, the verification of claims of origins of refugees I previously referred to. Also, the bias spread by the old telephone companies seems to be still alive in the daily technologies. The feminine voice representing a polite servant has been used many times by the artificial intelligent machines. Examples are the GPS navigator, Amazon’s personal assistant Alexa and the one of Apple, Siri. All these seems to go on without rethinking what these systems maintain in our society<br />


As Ong observed in Orality and Literacy, the condition for a democratic social dialogue in oral cultures was the presence of people in the space and their oral speech and thus after many discussions and performative behaviours they would reach to a decision together or they would just share their individual identities and stories.
== Bibliography ==
Especially I am interested in voice because our voice is a personal and unique element very related to our bodily conditions. For oral cultures the voice was a medium to spread knowledge, on a way that differs a lot from the writing cultures, "When auditory experiences are shared, histories too are shared, and not only from mouth to ear: they are perceived by and encoded in the body through the physicality of sound waves and passed on from one generation to another."
Anadiotis, G. (2018) Human in the loop: Machine learning and AI for the people, ZDNet. Available at: https://www.zdnet.com/article/human-in-the-loop-machine-learning-and-ai-for-the-people/ (Accessed: 21 November 2018).
"The companies behind this technology say that a voiceprint includes more than 100 unique physical and behavioural characteristics of each individual, such as length of the vocal tract, nasal passage, pitch, accent and so on.


There are several attempts [from feminists, artists, programmers, sociologists] of approaching hacking, technological cultures from a more feminist approach that involves the body, the vulnerabilities of the individual [text of hacking with care].
Jones, R. (2018) ‘Voice recognition: is it really as secure as it sounds?’, The Guardian, 22 September. Available at: https://www.theguardian.com/money/2018/sep/22/voice-recognition-is-it-really-as-secure-as-it-sounds (Accessed: 11 November 2018).


Public Radio - documenta 14 (2017). Available at: https://www.documenta14.de/en/public-radio/ (Accessed: 7 November 2018).


The Economist (2018) ‘Are liberals and populists just searching for a new master?’, 8 October. Available at: https://www.economist.com/open-future/2018/10/08/are-liberals-and-populists-just-searching-for-a-new-master (Accessed: 25 October 2018).


Notes:
Welle (www.dw.com), D. (2017) Automatic speech analysis software used to verify refugees’ dialects | DW | 17.03.2017, DW.COM. Available at: https://www.dw.com/en/automatic-speech-analysis-software-used-to-verify-refugees-dialects/a-37980819 (Accessed: 25 October 2018).
1. Diagram of a general communication system from Claude E. Shannon, A Mathematical Theory of Communication, 1948.
2. https://en.wikipedia.org/wiki/Encoding/decoding_model_of_communication
3.  estranged= 1. To make hostile, unsympathetic, or indifferent; 2. To remove from an accustomed place or set of associations
4. https://underbelly.nu/product/the-phantom-of-the-operator/
5. Voice samples for training speech analysis software (LDC). Tracing the samples
6. https://en.wikipedia.org/wiki/Human-in-the-loop


Bibliography:
== Notes ==
The Economist (2018) ‘Are liberals and populists just searching for a new master?’, 8 October. Available at: https://www.economist.com/open-future/2018/10/08/are-liberals-and-populists-just-searching-for-a-new-master (Accessed: 25 October 2018).

Latest revision as of 21:36, 27 March 2019

[UPDATE: My project now is focusing on the exclusion of female voices from public sphere. My explorations regard their amplification, through their mediation]

Human always in the communication loop

My title has derived from the term Human-in-the-loop (HITL) that “is a branch of artificial intelligence that leverages both human and machine intelligence to create machine learning models” (Anadiotis, 2018).

What do I want to make?

I want to make a series of experiments involving participants that will imitate activities happening parallelly to our communication systems. One possible outcome of this would be an online database with all these practices and material that will be produced. It could work as a toolkit for me and other people that are interested to get involved. Another possible outcome is a media object that will assist collective practices like that to happen.
The diagram of Shannon Claude of 1948, shown below, presents how a general communication system works. The message is passed through a transmitter that encodes it and transfers it through a one way channel to the receiver. The latter decodes the message and the person from the other side listens to or reads it. This way of communication, that the technology of the previous century created, brought a new relationship with our bodies. One of the most intense experiences of it was the detachment of our voice from our physical bodies. Our voice being mediated through different kinds of media gets distorted, sounds unfamiliar and gets disconnected from us.

Nowadays, the new data practices of big corporations like Google and Facebook are intervening into this communication system, listening to, transcribing, documenting our voices and messages. It is commonly unknown that the most important part of making and training an intelligent ‘assistant’ like Amazon's Alexa, that is based on speech recognition tool, is the contribution of huge data of human voices. The samples used by the corporations come from different sources and communication systems around the world. They may be imitations of conversations with microphone, radio broadcasts, telephone conversations and field recordings. At the same time these tools (like the automatic dialect analysis) are used from the state of Germany to verify the claims of origin of refugees. It is very often that they can get wrong because "Identifying the region of origin for anyone based on their speech is an extremely complex task" and depends on "a wide range of factors"(Welle (www.dw.com), 2017). Google also is using voice recordings of the users from other google apps to train its speech recognition tool.
Refugee interviewed
So, based on this contemporary condition, I annotated the diagram with possible interventions of the corporations of today. Even though the outcome of data practices is a product/machine, a huge amount of human activities are related in order to train it, like transcribing, recording and speaking. The one way channel becomes a loop, where the user communicating is also ‘input’ for the machine that will provide her/his tool of communication. We are more than users using various technical mediums, we are part of the loop.
As Zizek observes, it is very difficult for the common person to understand how algorithms work “but we can easily understand how we are controlled by the digital grid” (The Economist, 2018). I want to make visible the human presence in the loop by understanding it and share it in a physical space in real time with our physical presence.

Diagram-communication-loop.JPG

How do I plan to make it?

I plan to research on data practices and databases of voices and transcriptions to find out what practices are related to it and what I can borrow from them. One example of database [1] that I find interesting is one made by an organisation collecting voice samples with various ways and from various sources and medium. Some voice samples and transcriptions from that database:


(microphone conversation)

Interview 15
(A=Interviewer; B=Interviewee)
A: So we are recording.  Awesome.  So how long have you lived in Flint? (unclear)
B: 38 years.
A: Is that your whole life?  Wow you look really young.
B: Thank you!(...)

(telephone conversation/ giving directions on spot while walking)


And this is an example of a description accompanied these voice samples that shows the way that a voice sample was made/collected: ...The number of interviewees in a single program varies from one to three, but typically, one interviewer and two interviewees appear in the program. The material includes passages of interactive dialogue, but longer stretches of monologue-like speech comprise the majority of the collected data.... This material can be used for the actions I want to make.

At the same time exercises like the deep listening sessions of Pauline Oliveros [2] can be a daily experimentation where I can find a place between our bodies and the technology of mediation. How a human transfers the message as becoming the medium? These exercises are about sharing and following collectively instructions that we get connected with the inner functions of the body, like circulation, electromagnetism and vibrations, by reading, listening and moving. My personal experience of it that connects this exercise with the mediation of the voice was that: I was listening with headphones to the instructions of Oliveros and I was repeating what she was saying. The other participants were following the instructions coming from my mouth. I felt that I became the medium that didn't have time to interpret or intervene with my own personality into the message as it was new for me. I mediated the voice of Oliveros like a telephone would do.

I want to relate the data practices with the collective exercises on a way that will activate the loop I described with the human presence. Some of the actions will involve human annotators [transcription], listening and identifying voices, decoding and encoding, donating voice samples.

The documentation of all these will resemble the databases I will find but with a different purpose than training machines.


What is my timetable?

Until the end of January: Conducting interviews with people related in collecting voice data or engaged with and re-appropriating communication technologies. I will participate in workshops or lectures related to these topics throughout the year, like CCC (Chaos Communication Congress). I will research on recordings/databases and ways of collecting voice samples for training. At the same time I will experiment with mediation in private and public spaces. I will also do more experiments on collective reading, listening, repeating, walking. I will document observations and audio, video, photos, text produced.
26/11: in my pyratechnic session with Alice I will prepare a loop process in which the students will try different actions related to the loop in my diagram.
30/11: I will try the same in Leeszaal, where people from different countries and backgrounds visit.
I will decide what the outcome will be.
February: I will relate the dynamics of the previous experiments and I will proceed with involving more people in the process. Invite them to share their personal relation to these experiments (how they perceive the mediation of their voice and their presence in the loop) and use this material in the process. Merge my process with facts related to the people involved (where is that process important or not).
March-April: Considering the documentation and outcome of my processes I will re-think my next steps and select part of the practices that have more impact.
May-June: Wrap up

Why do I want to make it?

I believe that all these procedures happening on a daily base with or without our knowledge. They have an affect on our physical bodies that we unconsciously skip. I want to create an experiential model of collective sharing of our voices based on the system I am researching on and described earlier. My purpose is to strengthen our awareness for our involvement in the loop and answer in some questions that bother me. I want to investigate these questions with the groups of people that will be involved in the process. Some of these questions is how our bodies are influenced by these systems, what is the control over them, what are the new relation with our voices and how can we appropriate these new technologies more consciously, what new material and tools these experiments produce?
Our communication and our relation with these systems have been mainly formed by big corporations or the states and their character is mostly male technical or military. It seems to me that the communication platforms are estranged[3] realities where the personal body [cultural, physical, political, gender] disappears</ref> realities difficult to understand. The context (social, cultural background, gender) that defines an individual is disappearing or manipulated into this massive automated process happening throughout our communication systems. Our data and bodies the same. I am interested especially in voice because it is a personal and unique element carrying many things from the personality. For oral cultures the voice was a medium to spread knowledge, on a way that differs a lot from the writing cultures, "When auditory experiences are shared, histories too are shared, and not only from mouth to ear: they are perceived by and encoded in the body through the physicality of sound waves and passed on from one generation to another"(Public Radio - documenta 14, 2017).
Technology becomes an extension of this desire to reach the invisible and distant, something beyond the limitations of your own body. But when I talk about detachment I don’t mean it in a negative sense; there is first an alienation and frustration and distancing, but if we understand it with our body we understand this communication, so it is a way of understanding media through simple techniques as, for instance, just ‘repeating’ a youtube video or transcribing the voice of our interlocutor.

Who can help me and how?

People I interview like Reni Hofmüller can help me on experimenting with the specific technology of mediation and imagining other aspects of them. Also, Raadio Caargo can help me imagining potential futures of the mediums by engaging with feminist methodologies and practices. Joana, a former student can help me with prototyping, references and discussion on embodied and distant voice. My tutors Amy and Clara with deep listening exercises. My XPUB colleagues with participating in my first experiments.

Previously

I move towards this direction the last years. I worked with collective writing, speech recognition, transcription, collective annotation and collective reading. It is very often in my work that I am interested in the mediation of the voice. In some of my projects I used several tools that were relating different participants in different places through the sharing of their voice.

Relation to a larger context

There are several attempts- from feminists, artists and programmers- of approaching hacking, technological cultures from a more feminist approach that involves the body and the vulnerabilities of the individual. One example is the collective Hacking With Care that include practices of taking care of their body while working as hackers. Another one is the work r∆∆dio c∆∆rgo and Spideralex that reclaim their relationships with technologies by engaging with different collective methodologies and practices. At my point of view these attempts aim to subvert the dominant uses and relations with the technology and propose other ways of communicating. In the old times telephone operators, mostly women, were working underpaid in a stressful position. They were invisible and their voices were representing a specific feminine character that was promoted by their companies, serving their clients. I think one contemporary version of this labour is the contribution of people in databases for training a machine. Though the conditions are better and different, the morals around it are under consideration. "(S)hould we be worried about the large-scale harvesting of our voiceprints? (…) The companies behind this technology say that a voiceprint includes more than 100 unique physical and behavioural characteristics of each individual " (Jones, 2018). At the other side of this machines is the effect they have on decisions regarding people’s lives and access in spaces. For example, the verification of claims of origins of refugees I previously referred to. Also, the bias spread by the old telephone companies seems to be still alive in the daily technologies. The feminine voice representing a polite servant has been used many times by the artificial intelligent machines. Examples are the GPS navigator, Amazon’s personal assistant Alexa and the one of Apple, Siri. All these seems to go on without rethinking what these systems maintain in our society

Bibliography

Anadiotis, G. (2018) Human in the loop: Machine learning and AI for the people, ZDNet. Available at: https://www.zdnet.com/article/human-in-the-loop-machine-learning-and-ai-for-the-people/ (Accessed: 21 November 2018).

Jones, R. (2018) ‘Voice recognition: is it really as secure as it sounds?’, The Guardian, 22 September. Available at: https://www.theguardian.com/money/2018/sep/22/voice-recognition-is-it-really-as-secure-as-it-sounds (Accessed: 11 November 2018).

Public Radio - documenta 14 (2017). Available at: https://www.documenta14.de/en/public-radio/ (Accessed: 7 November 2018).

The Economist (2018) ‘Are liberals and populists just searching for a new master?’, 8 October. Available at: https://www.economist.com/open-future/2018/10/08/are-liberals-and-populists-just-searching-for-a-new-master (Accessed: 25 October 2018).

Welle (www.dw.com), D. (2017) Automatic speech analysis software used to verify refugees’ dialects | DW | 17.03.2017, DW.COM. Available at: https://www.dw.com/en/automatic-speech-analysis-software-used-to-verify-refugees-dialects/a-37980819 (Accessed: 25 October 2018).

Notes

  1. https://catalog.ldc.upenn.edu/search
  2. http://deeplistening.org/site/content/workshops
  3. estranged= 1. To make hostile, unsympathetic, or indifferent; 2. To remove from an accustomed place or set of associations