Год выпуска: 2010 Автор: Ventsislav Zhechev Издательство: LAP Lambert Academic Publishing Страниц: 148 ISBN: 9783838327952
Описание
The need for syntactically annotated data for use in natural language processing has increased dramatically in recent years. This is true especially for parallel treebanks, of which very few exist. The ones that exist are mainly hand-crafted and too small for reliable use in data- oriented applications. This work is targeted at the developers and users of Machine Translation technology. It introduces a novel open-source platform for the fast and robust automatic generation of parallel treebanks through sub-tree alignment, using a limited amount of external resources. The intrinsic and extrinsic evaluations that were undertaken demonstrate that this system is a feasible alternative to the manual annotation of parallel treebanks. Therefore, the presented platform is expected to help boost research in the field of syntax- augmented machine translation and lead to advancements in other fields where parallel treebanks can be employed.
Хочу поблагодарить Вас за очень красивый, понятный, а главное - полезный сайт. Убеждена, что каждый человек должен заниматься своим делом, профессионально и качественно. Судя по содержанию статей и по отзывам на сайте, Вы замечательно справляетесь со своим делом!