Spoken to Spoken vs. Spoken to Written: Corpus Approach to Exploring Interpreting and Subtitling

Title	Spoken to Spoken vs. Spoken to Written: Corpus Approach to Exploring Interpreting and Subtitling
Publication Type	Conference Paper
Year of Publication	2010
Authors	Mikhailov M, Tommola H, Isolahti N
Conference Name	POLIBITS
Keywords	corpora, interpreting, subtitling
Abstract	The need for corpora of interpreting discourse in translation studies is gradually increasing. The research of AV translation is another rapidly developing sphere, thus corpora of subtitling and dubbing would also be quite useful. The main reason of the lack in such resources is the difficulty of obtaining data and the inevitability of manual data input. An interpreting corpus would be a collection of transcripts of speech in two or more languages with part of the transcripts aligned. The subtitling and dubbing corpora can be designed using the same principles. The structure of the corpus should reflect the polyphonic nature of the data. Thus, markup becomes extremely important in these types of corpora. The research presented in this paper deals with corpora of Finnish-Russian interpreting discourse and subtitling. The software package developed for processing of the corpora includes routines specially written for studying speech transcripts rather than written text. For example, speaker statistics function calculates number of words, number of pauses, their duration, average speech tempo of a certain speaker.
URL	http://www.mtc.cidetec.ipn.mx/POLIBITS/2010_41/p41.pdf

You are here