Searching for just a few words should be enough to get started. If you need to make more complex queries, use the tips below to guide you.
Issue title: New Methods in Bioinformatics Presented at the Fifth International Conference on Bioinformatics of Genome Regulation and Structure
Guest editors: Evgenii Vityaevx and Nikolay Kolchanovy
Article type: Research Article
Authors: Levitsky, Victora; b; * | Ignatieva, Elenaa; b | Aman, Eugeniaa | Merkulova, Tatyanaa; b | Kolchanov, Nikolaya; b | Hodgman, Charlesc
Affiliations: [a] Laboratory of Theoretical Genetics, Institute of Cytology and Genetics, Novosibirsk, 630090, Russia | [b] Department of Natural Science, Novosibirsk State University, Novosibirsk, 630090, Russia | [c] Multidisciplinary Centre for Integrative Biology, School of Biosciences, University of Nottingham, Sutton Bonington, LE12 5RD, UK | [x] Sobolev Institute of Mathematics, Koptyug aven. 4, Novosibirsk, 630090, Russia | [y] Laboratory of Theoretical Genetics, Institute of Cytology and Genetics, Novosibirsk, 630090, Russia
Correspondence: [*] Corresponding author. Tel.: +7 383 3333119; Fax: +7 383 3331278; E-mail: levitsky@bionet.nsc.ru.
Abstract: Development of reliable transcription factor binding site (TFBS) recognition methods is an important step in the large-scale genome analysis. The most of currently applied methods to predict functional TFBSs are hampered by the high false-positive rates that occur when too few functionally characterised sequences are available and only sequence conservation within a site core is considered. We propose two methods to search for binding sites (BSs) of peroxisome proliferator-activated receptor (PPAR) (peroxisome proliferator response elements, PPREs). The first method is the optimized dinucleotide position weight matrix (PWM) model, the second method represented by SiteGA model that used genetic algorithm with a discriminant function of locally positioned dinucleotides to infer the most important positions and dinucleotides. We used in our analysis two PPRE datasets, consisting of 37 and 98 BSs, correspondingly. We showed that dataset extension improved the accuracy of SiteGA, but not PWM model. Finally we combined both models (PWM and SiteGA) to the dataset of annotated human promoters (EPD). We demonstrated that the larger dataset and the longer window length supported notable growth of accuracies for PWM and SiteGA models. Consequently, a combined PWM and SiteGA application may better restrict the number of potential targets in the EPD promoter dataset.
Keywords: Transcription factor binding site recognition, peroxisome proliferator-activated receptor, peroxisome proliferator response elements, discriminant analysis, genetic algorithm, position weight matrix
DOI: 10.3233/IDA-2008-12506
Journal: Intelligent Data Analysis, vol. 12, no. 5, pp. 513-526, 2008
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA
Tel: +1 703 830 6300
Fax: +1 703 830 2300
sales@iospress.com
For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl
IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands
Tel: +31 20 688 3355
Fax: +31 20 687 0091
info@iospress.nl
For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office info@iospress.nl
Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China
Free service line: 400 661 8717
Fax: +86 10 8446 7947
china@iospress.cn
For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl
如果您在出版方面需要帮助或有任何建, 件至: editorial@iospress.nl