Silvestre Cerdà, Joan Albert; Garcia Martinez, Maria Mercedes; Barrón Cedeño, Luis Alberto; Civera Saiz, Jorge; Rosso ., Paolo(CEUR Workshop Proceedings, 2011)
[EN] This paper presents a proposal for extracting parallel corpora from Wikipedia on the basis of statistical machine translation techniques. We have used
word-level alignment models from IBM in order to obtain phrase-level ...