Compo: composite motif discovery using discrete models

dc.contributor.author	Sandve, Geir K
dc.contributor.author	Abul, Osman
dc.contributor.author	Drabløs, Finn
dc.date.accessioned	2015-10-09T01:04:29Z
dc.date.available	2015-10-09T01:04:29Z
dc.date.issued	2008
dc.identifier.citation	BMC Bioinformatics. 2008 Dec 08;9(1):527
dc.identifier.uri	http://hdl.handle.net/10852/46385
dc.description.abstract	Background Computational discovery of motifs in biomolecular sequences is an established field, with applications both in the discovery of functional sites in proteins and regulatory sites in DNA. In recent years there has been increased attention towards the discovery of composite motifs, typically occurring in cis-regulatory regions of genes. Results This paper describes Compo: a discrete approach to composite motif discovery that supports richer modeling of composite motifs and a more realistic background model compared to previous methods. Furthermore, multiple parameter and threshold settings are tested automatically, and the most interesting motifs across settings are selected. This avoids reliance on single hard thresholds, which has been a weakness of previous discrete methods. Comparison of motifs across parameter settings is made possible by the use of p-values as a general significance measure. Compo can either return an ordered list of motifs, ranked according to the general significance measure, or a Pareto front corresponding to a multi-objective evaluation on sensitivity, specificity and spatial clustering. Conclusion Compo performs very competitively compared to several existing methods on a collection of benchmark data sets. These benchmarks include a recently published, large benchmark suite where the use of support across sequences allows Compo to correctly identify binding sites even when the relevant PWMs are mixed with a large number of noise PWMs. Furthermore, the possibility of parameter-free running offers high usability, the support for multi-objective evaluation allows a rich view of potential regulators, and the discrete model allows flexibility in modeling and interpretation of motifs.
dc.language.iso	eng
dc.rights	Sandve et al.
dc.rights	Attribution 2.0 Generic
dc.rights.uri	http://creativecommons.org/licenses/by/2.0/
dc.title	Compo: composite motif discovery using discrete models
dc.type	Journal article
dc.date.updated	2015-10-09T01:04:30Z
dc.creator.author	Sandve, Geir K
dc.creator.author	Abul, Osman
dc.creator.author	Drabløs, Finn
dc.identifier.doi	http://dx.doi.org/10.1186/1471-2105-9-527
dc.identifier.urn	URN:NBN:no-50520
dc.type.document	Tidsskriftartikkel
dc.type.peerreviewed	Peer reviewed
dc.identifier.fulltext	Fulltext https://www.duo.uio.no/bitstream/handle/10852/46385/1/12859_2008_Article_2512.pdf
dc.type.version	PublishedVersion
cristin.articleid	527