Anton Tsukanov1, Victor Levitsky2, Tatyana Merkulova3
1ICG SB RAS, Novosibirsk, Russia, tsukanov@bionet.nsc.ru
2ICG SB RAS, Novosibirsk, Russia, levitsky@bionet.nsc.ru
3ICG SB RAS, Novosibirsk, Russia, merkulova@bionet.nsc.ru

We developed pipeline for integrative application of various de novo motif search tools to massive sequencing data. While traditional position weight matrices (PWMs) neglect dependencies between positions of motifs, the ‘short-range interactions’ markov models BAMM/InMode permit only local dependencies, the ‘long-range interaction’ model SiteGA allows dependencies between arbitrary positions. The massive analysis of ChIP-seq data have shown that the models BAMM/InMode and PWMs recognized similar and significantly overlapped peaks; but the notable fractions of peaks predicted by SiteGA model were not predicted by other models.