Please wait a minute...
Front. Inform. Technol. Electron. Eng.  2011, Vol. 12 Issue (4): 263-272    DOI: 10.1631/jzus.C1000091
    
Structural visualization of sequential DNA data
Xiao-hong Mao1, Jing-hua Fu2, Wei Chen*,2, Qian You3, Shiao-fen Fang3, Qun-sheng Peng2
1 The Second Affiliated Hospital, School of Medicine, Zhejiang University, Hangzhou 310013, China 2 State Key Lab of CAD & CG, Zhejiang University, Hangzhou 310058, China 3 Department of Computer and Information Science, Indiana University-Purdue University Indianapolis (IUPUI), Indianapolis, IN 46202, USA
Download:   PDF(1354KB)
Export: BibTeX | EndNote (RIS)      

Abstract  To date, comparing and visualizing genome sequences remain challenging due to the large genome size. Existing approaches take advantage of the stable property of oligonucleotides and exhibit the main characteristics of the whole genome, yet they commonly fail to show progression patterns of the genome adjustably. This paper presents a novel visual encoding technique, which not only supports the binning process (phylogenetic analysis), but also allows the sequential analysis of the genome. The key idea is to regard the combination of each k-nucleotide and its reverse complement as a visual word, and to represent a long genome sequence with a list of local statistical feature vectors derived from the local frequency of the visual words. Experimental results on a variety of examples demonstrate that the presented approach has the ability to quickly and intuitively visualize DNA sequences, and to help the user identify regions of differences among multiple datasets.

Key wordsGenome sequence      Sequential visualization      Bio-information visualization     
Received: 11 April 2010      Published: 11 April 2011
CLC:  TP391.1  
  R394.3  
Cite this article:

Xiao-hong Mao, Jing-hua Fu, Wei Chen, Qian You, Shiao-fen Fang, Qun-sheng Peng. Structural visualization of sequential DNA data. Front. Inform. Technol. Electron. Eng., 2011, 12(4): 263-272.

URL:

http://www.zjujournals.com/xueshu/fitee/10.1631/jzus.C1000091     OR     http://www.zjujournals.com/xueshu/fitee/Y2011/V12/I4/263


Structural visualization of sequential DNA data

To date, comparing and visualizing genome sequences remain challenging due to the large genome size. Existing approaches take advantage of the stable property of oligonucleotides and exhibit the main characteristics of the whole genome, yet they commonly fail to show progression patterns of the genome adjustably. This paper presents a novel visual encoding technique, which not only supports the binning process (phylogenetic analysis), but also allows the sequential analysis of the genome. The key idea is to regard the combination of each k-nucleotide and its reverse complement as a visual word, and to represent a long genome sequence with a list of local statistical feature vectors derived from the local frequency of the visual words. Experimental results on a variety of examples demonstrate that the presented approach has the ability to quickly and intuitively visualize DNA sequences, and to help the user identify regions of differences among multiple datasets.

关键词: Genome sequence,  Sequential visualization,  Bio-information visualization 
[1] Hui Chen, Bao-gang Wei, Yi-ming Li, Yong-huai Liu, Wen-hao Zhu. An easy-to-use evaluation framework for benchmarking entity recognition and disambiguation systems[J]. Front. Inform. Technol. Electron. Eng., 2017, 18(2): 195-205.
[2] Xi-ming Li, Ji-hong Ouyang, You Lu. Topic modeling for large-scale text data[J]. Front. Inform. Technol. Electron. Eng., 2015, 16(6): 457-465.
[3] Jin-song Su, Xiao-dong Shi, Yan-zhou Huang, Yang Liu, Qing-qiang Wu, Yi-dong Chen, Huai-lin Dong. Topic-aware pivot language approach for statistical machine translation[J]. Front. Inform. Technol. Electron. Eng., 2014, 15(4): 241-253.
[4] Yun-hua Qu, Tian-jiong Tao, Serge Sharoff, Narisong Jin, Ruo-yuan Gao, Nan Zhang, Yu-ting Yang, Cheng-zhi Xu. Using an integrated feature set to generalize and justify the Chinese-to-English transferring rule of the ‘ZHE’ aspect[J]. Front. Inform. Technol. Electron. Eng., 2010, 11(9): 663-676.