Using a Parallel Transcript/Subtitle Corpus for Sentence Compression
Vandeghinste,V. ; Tjong Kim Sang,E.F.
Vandeghinste,V.
Tjong Kim Sang,E.F.
Abstract
In this paper we describe the collection of a parallel corpus (in Dutch) and its use in a sentence compression tool with the intention to automatically generate subtitles for the deaf from transcripts of a television program. First, the collection of the corpus is described, together with the manipulations and transformations performed on that corpus. Second, a hybrid sentence compression tool is described together with its evaluation.
Description
Pagination: 4
Date
2004
Journal Title
Journal ISSN
Volume Title
Publisher
Unknown Publisher
Research Projects
Organizational Units
Journal Issue
Keywords
Citation
Vandeghinste, V & Tjong Kim Sang, E F 2004, Using a Parallel Transcript/Subtitle Corpus for Sentence Compression. in Proceedings of the 4th International Language Resources and Evaluation Conference (LREC 2004). Unknown Publisher, Lisbon, pp. 231-234.
