High-performance bilingual text alignment using statistical and dictionary information

MASAHIKO HARUNO; TAKEFUMI YAMAZAKI

doi:10.1017/S1351324997001459

High-performance bilingual text alignment using statistical and dictionary information

Published online by Cambridge University Press: 01 March 1997

MASAHIKO HARUNO and

TAKEFUMI YAMAZAKI

Show author details

MASAHIKO HARUNO: Affiliation:
NTT Communication Science Laboratories, 1-1 Hikarinooka Yokosuka Kanagawa 239, Japan
TAKEFUMI YAMAZAKI: Affiliation:
NTT Communication Science Laboratories, 1-1 Hikarinooka Yokosuka Kanagawa 239, Japan

Article contents

Abstract

Get access

Rights & Permissions

Abstract

This paper describes an accurate and robust text alignment system for structurally different languages. Among structurally different languages such as Japanese and English, there is a limitation on the amount of word correspondences that can be statistically acquired. The main reason for this is the systems of functional (closed) words are quite different in the two languages. The proposed method makes use of two kinds of word correspondences in aligning bilingual texts. One is a bilingual dictionary of general use. The other is the word correspondences that are statistically acquired in the alignment process. Our method gradually determines sentence pairs (anchors) that correspond to each other by relaxing parameters. The method, by combining two kinds of word correspondences, achieves adequate word correspondences for complete alignment. As a result, texts of various length and of various genres in structurally different languages can be aligned with high precision. Experimental results show our system outperforms conventional methods for various kinds of Japanese–English texts.

Type: Research Article
Information: Natural Language Engineering , Volume 3 , Issue 1 , March 1997 , pp. 1 - 14

DOI: https://doi.org/10.1017/S1351324997001459 [Opens in a new window]

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article contents

High-performance bilingual text alignment using statistical and dictionary information

Abstract

Access options

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests