Please wait a minute...
Front. Inform. Technol. Electron. Eng.  2010, Vol. 11 Issue (9): 663-676    DOI: 10.1631/jzus.C1000104
    
Using an integrated feature set to generalize and justify the Chinese-to-English transferring rule of the ‘ZHE’ aspect
Yun-hua Qu*,1, Tian-jiong Tao2, Serge Sharoff3, Narisong Jin4, Ruo-yuan Gao5, Nan Zhang6, Yu-ting Yang7, Cheng-zhi Xu8
1 School of International Studies, Zhejiang University, Hangzhou 310058, China 2 Institute of Fluid Physics, China Academy of Engineering Physics, Mianyang 621900, China 3 Center for Translation Studies, University of Leeds, Leeds LS2 9JT, UK 4 Center for Applied Linguistics, Hangzhou Normal University, Hangzhou 310012, China 5 School of Computer Science and Technology, Zhejiang University, Hangzhou 310027, China 6 Sensor Network and Application Research Center, Graduate School, Chinese Academy of Sciences, Beijing 100049, China 7 School of Economics, Zhejiang University, Hangzhou 310027, China 8 Zhejiang University City College, Hangzhou 310015, China
Using an integrated feature set to generalize and justify the Chinese-to-English transferring rule of the ‘ZHE’ aspect
Yun-hua Qu*,1, Tian-jiong Tao2, Serge Sharoff3, Narisong Jin4, Ruo-yuan Gao5, Nan Zhang6, Yu-ting Yang7, Cheng-zhi Xu8
1 School of International Studies, Zhejiang University, Hangzhou 310058, China 2 Institute of Fluid Physics, China Academy of Engineering Physics, Mianyang 621900, China 3 Center for Translation Studies, University of Leeds, Leeds LS2 9JT, UK 4 Center for Applied Linguistics, Hangzhou Normal University, Hangzhou 310012, China 5 School of Computer Science and Technology, Zhejiang University, Hangzhou 310027, China 6 Sensor Network and Application Research Center, Graduate School, Chinese Academy of Sciences, Beijing 100049, China 7 School of Economics, Zhejiang University, Hangzhou 310027, China 8 Zhejiang University City College, Hangzhou 310015, China
 全文: PDF 
摘要: In machine translation (MT) practice, there is an urgent need for constructing a set of Chinese-to-English aspect transferring rules to define the transferring conditions. The integrated feature set was used to generalize and justify the Chinese-to-English transferring rule of the ‘ZHE’ aspect (ZHE Rule). A ZHE classification model was built in this study. The impacts of each set of temporal, lexical aspectual, and syntactic features, and their integrated impacts, on the accuracy of the ZHE Rule were tested. Over 600 misclassified corpus sentences were manually examined. A 10-fold cross-validation was used with a decision tree algorithm. The main results are: (1) The ZHE Rule was generalized and justified to have a higher accuracy under the two metrics: the precision rate and the areas under the receiver operating characteristic curve (AUC). (2) The temporal, lexical aspectual, and syntactic feature sets have an integrated contribution to the accuracy of the ZHE Rule. The syntactic and temporal features have an impact on ZHE aspect derivations, while the lexical aspectual features are not predictive of ZHE aspect derivation. (3) While associated with active verbs, the ZHE aspect can denote a perfective situation. This study suggests that the temporal and syntactic features are the predictive ZHE aspect classification features and that the ZHE Rule with an overall precision rate of 80.1% is accurate enough to be further explored in MT practice. The machine learning method, decision tree, can be applied to the automatic aspect transferring in MT research and aspectual interpretations in linguistic research.
关键词: ZHE aspect transferring rule (ZHE Rule)Machine learningDecision treeAspect classificationIntegrated feature set    
Abstract: In machine translation (MT) practice, there is an urgent need for constructing a set of Chinese-to-English aspect transferring rules to define the transferring conditions. The integrated feature set was used to generalize and justify the Chinese-to-English transferring rule of the ‘ZHE’ aspect (ZHE Rule). A ZHE classification model was built in this study. The impacts of each set of temporal, lexical aspectual, and syntactic features, and their integrated impacts, on the accuracy of the ZHE Rule were tested. Over 600 misclassified corpus sentences were manually examined. A 10-fold cross-validation was used with a decision tree algorithm. The main results are: (1) The ZHE Rule was generalized and justified to have a higher accuracy under the two metrics: the precision rate and the areas under the receiver operating characteristic curve (AUC). (2) The temporal, lexical aspectual, and syntactic feature sets have an integrated contribution to the accuracy of the ZHE Rule. The syntactic and temporal features have an impact on ZHE aspect derivations, while the lexical aspectual features are not predictive of ZHE aspect derivation. (3) While associated with active verbs, the ZHE aspect can denote a perfective situation. This study suggests that the temporal and syntactic features are the predictive ZHE aspect classification features and that the ZHE Rule with an overall precision rate of 80.1% is accurate enough to be further explored in MT practice. The machine learning method, decision tree, can be applied to the automatic aspect transferring in MT research and aspectual interpretations in linguistic research.
Key words: ZHE aspect transferring rule (ZHE Rule)    Machine learning    Decision tree    Aspect classification    Integrated feature set
收稿日期: 2010-04-18 出版日期: 2010-09-07
CLC:  TP391.1  
基金资助: Project  supported  by  the  National  Social  Science  Foundation  of China  (No.  08BYY001)  and  the  Worldwide  Universities  Network
2009 Research Mobility Programme
通讯作者: Yun-hua QU     E-mail: qu163hua@163.com
服务  
把本文推荐给朋友
加入引用管理器
E-mail Alert
RSS
作者相关文章  
Yun-hua Qu
Tian-jiong Tao
Serge Sharoff
Narisong Jin
Ruo-yuan Gao
Nan Zhang
Yu-ting Yang
Cheng-zhi Xu

引用本文:

Yun-hua Qu, Tian-jiong Tao, Serge Sharoff, Narisong Jin, Ruo-yuan Gao, Nan Zhang, Yu-ting Yang, Cheng-zhi Xu. Using an integrated feature set to generalize and justify the Chinese-to-English transferring rule of the ‘ZHE’ aspect. Front. Inform. Technol. Electron. Eng., 2010, 11(9): 663-676.

链接本文:

http://www.zjujournals.com/xueshu/fitee/CN/10.1631/jzus.C1000104        http://www.zjujournals.com/xueshu/fitee/CN/Y2010/V11/I9/663

[1] Yi LIN, Jian-wei ZHANG, Hong LIU. An algorithm for trajectory prediction of flight plan based on relative motion between positions[J]. Front. Inform. Technol. Electron. Eng., 2018, 19(7): 905-916.
[2] Rabia IRFAN , Sharifullah KHAN, Kashif RAJPOOT, Ali Mustafa QAMAR. TIE algorithm: a layer over clustering-based taxonomy generation for handling evolving data[J]. Front. Inform. Technol. Electron. Eng., 2018, 19(6): 763-782.
[3] Ahmad FIRDAUS, Nor Badrul ANUAR, Ahmad KARIM, Mohd Faizal Ab RAZAK. Discovering optimal features using static analysis and a genetic search based method for Android malware detection[J]. Front. Inform. Technol. Electron. Eng., 2018, 19(6): 712-736.
[4] Liu LIU , Bao-sheng WANG, Bo YU, Qiu-xi ZHONG. Automatic malware classification and new malware detection using machine learning[J]. Front. Inform. Technol. Electron. Eng., 2017, 18(9): 1336-1347.
[5] Lei-lei KONG, Zhi-mao LU, Hao-liang QI, Zhong-yuan HAN. A machine learning approach to query generation in plagiarism source retrieval[J]. Front. Inform. Technol. Electron. Eng., 2017, 18(10): 1556-1572.
[6] Zhang Liang, Jun Xiao, Yue-ting Zhuang. Synthesizing style-preserving cartoons via non-negative style factorization[J]. Front. Inform. Technol. Electron. Eng., 2012, 13(3): 196-207.
[7] Xiao-lei Ma, Yin-hai Wang, Feng Chen, Jian-feng Liu. Transit smart card data mining for passenger origin information extraction[J]. Front. Inform. Technol. Electron. Eng., 2012, 13(10): 750-760.