HCV quasispecies assembly using network flows

Kelly Westbrooks, Irina Astrovskaya, David Campo, Yury Khudyakov, Piotr Berman, Alex Zelikovsky

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    28 Citations (Scopus)

    Abstract

    Understanding how the genomes of viruses mutate and evolve within infected individuals is critically important in epidemiology. By exploiting knowledge of the forces that guide viral microevolution, researchers can design drugs and treatments that are effective against newly evolved strains. Therefore, it is critical to develop a method for typing the genomes of all of the variants of a virus (quasispecies) inside an infected individual cell. In this paper, we focus on sequence assembly of Hepatitis C Virus (HCV) based on 454 Lifesciences system that produces around 250K reads each 100-400 base long. We introduce several formulations of the quasispecies assembly problem and a measure of the assembly quality. We also propose a novel scalable assembling method for quasispecies based on a novel network flow formulation. Finally, we report the results of assembling 44 quasispecies from the 1700 bp long E1E2 region of HCV.

    Original languageEnglish (US)
    Title of host publicationBioinformatics Research and Applications - Fourth International Symposium, ISBRA 2008, Proceedings
    Pages159-170
    Number of pages12
    DOIs
    StatePublished - Aug 27 2008
    Event4th International Symposium on Bioinformatics Research and Applications, ISBRA 2008 - Atlanta, GA, United States
    Duration: May 6 2008May 9 2008

    Publication series

    NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
    Volume4983 LNBI
    ISSN (Print)0302-9743
    ISSN (Electronic)1611-3349

    Other

    Other4th International Symposium on Bioinformatics Research and Applications, ISBRA 2008
    CountryUnited States
    CityAtlanta, GA
    Period5/6/085/9/08

    Fingerprint

    Quasispecies
    Network Flow
    Viruses
    Virus
    Genome
    Genes
    Epidemiology
    Drug Design
    Formulation
    Cell

    All Science Journal Classification (ASJC) codes

    • Theoretical Computer Science
    • Computer Science(all)

    Cite this

    Westbrooks, K., Astrovskaya, I., Campo, D., Khudyakov, Y., Berman, P., & Zelikovsky, A. (2008). HCV quasispecies assembly using network flows. In Bioinformatics Research and Applications - Fourth International Symposium, ISBRA 2008, Proceedings (pp. 159-170). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 4983 LNBI). https://doi.org/10.1007/978-3-540-79450-9_15
    Westbrooks, Kelly ; Astrovskaya, Irina ; Campo, David ; Khudyakov, Yury ; Berman, Piotr ; Zelikovsky, Alex. / HCV quasispecies assembly using network flows. Bioinformatics Research and Applications - Fourth International Symposium, ISBRA 2008, Proceedings. 2008. pp. 159-170 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
    @inproceedings{4aca838bb971433499ee42ce2682d654,
    title = "HCV quasispecies assembly using network flows",
    abstract = "Understanding how the genomes of viruses mutate and evolve within infected individuals is critically important in epidemiology. By exploiting knowledge of the forces that guide viral microevolution, researchers can design drugs and treatments that are effective against newly evolved strains. Therefore, it is critical to develop a method for typing the genomes of all of the variants of a virus (quasispecies) inside an infected individual cell. In this paper, we focus on sequence assembly of Hepatitis C Virus (HCV) based on 454 Lifesciences system that produces around 250K reads each 100-400 base long. We introduce several formulations of the quasispecies assembly problem and a measure of the assembly quality. We also propose a novel scalable assembling method for quasispecies based on a novel network flow formulation. Finally, we report the results of assembling 44 quasispecies from the 1700 bp long E1E2 region of HCV.",
    author = "Kelly Westbrooks and Irina Astrovskaya and David Campo and Yury Khudyakov and Piotr Berman and Alex Zelikovsky",
    year = "2008",
    month = "8",
    day = "27",
    doi = "10.1007/978-3-540-79450-9_15",
    language = "English (US)",
    isbn = "3540794492",
    series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
    pages = "159--170",
    booktitle = "Bioinformatics Research and Applications - Fourth International Symposium, ISBRA 2008, Proceedings",

    }

    Westbrooks, K, Astrovskaya, I, Campo, D, Khudyakov, Y, Berman, P & Zelikovsky, A 2008, HCV quasispecies assembly using network flows. in Bioinformatics Research and Applications - Fourth International Symposium, ISBRA 2008, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 4983 LNBI, pp. 159-170, 4th International Symposium on Bioinformatics Research and Applications, ISBRA 2008, Atlanta, GA, United States, 5/6/08. https://doi.org/10.1007/978-3-540-79450-9_15

    HCV quasispecies assembly using network flows. / Westbrooks, Kelly; Astrovskaya, Irina; Campo, David; Khudyakov, Yury; Berman, Piotr; Zelikovsky, Alex.

    Bioinformatics Research and Applications - Fourth International Symposium, ISBRA 2008, Proceedings. 2008. p. 159-170 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 4983 LNBI).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    TY - GEN

    T1 - HCV quasispecies assembly using network flows

    AU - Westbrooks, Kelly

    AU - Astrovskaya, Irina

    AU - Campo, David

    AU - Khudyakov, Yury

    AU - Berman, Piotr

    AU - Zelikovsky, Alex

    PY - 2008/8/27

    Y1 - 2008/8/27

    N2 - Understanding how the genomes of viruses mutate and evolve within infected individuals is critically important in epidemiology. By exploiting knowledge of the forces that guide viral microevolution, researchers can design drugs and treatments that are effective against newly evolved strains. Therefore, it is critical to develop a method for typing the genomes of all of the variants of a virus (quasispecies) inside an infected individual cell. In this paper, we focus on sequence assembly of Hepatitis C Virus (HCV) based on 454 Lifesciences system that produces around 250K reads each 100-400 base long. We introduce several formulations of the quasispecies assembly problem and a measure of the assembly quality. We also propose a novel scalable assembling method for quasispecies based on a novel network flow formulation. Finally, we report the results of assembling 44 quasispecies from the 1700 bp long E1E2 region of HCV.

    AB - Understanding how the genomes of viruses mutate and evolve within infected individuals is critically important in epidemiology. By exploiting knowledge of the forces that guide viral microevolution, researchers can design drugs and treatments that are effective against newly evolved strains. Therefore, it is critical to develop a method for typing the genomes of all of the variants of a virus (quasispecies) inside an infected individual cell. In this paper, we focus on sequence assembly of Hepatitis C Virus (HCV) based on 454 Lifesciences system that produces around 250K reads each 100-400 base long. We introduce several formulations of the quasispecies assembly problem and a measure of the assembly quality. We also propose a novel scalable assembling method for quasispecies based on a novel network flow formulation. Finally, we report the results of assembling 44 quasispecies from the 1700 bp long E1E2 region of HCV.

    UR - http://www.scopus.com/inward/record.url?scp=49949112775&partnerID=8YFLogxK

    UR - http://www.scopus.com/inward/citedby.url?scp=49949112775&partnerID=8YFLogxK

    U2 - 10.1007/978-3-540-79450-9_15

    DO - 10.1007/978-3-540-79450-9_15

    M3 - Conference contribution

    AN - SCOPUS:49949112775

    SN - 3540794492

    SN - 9783540794493

    T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

    SP - 159

    EP - 170

    BT - Bioinformatics Research and Applications - Fourth International Symposium, ISBRA 2008, Proceedings

    ER -

    Westbrooks K, Astrovskaya I, Campo D, Khudyakov Y, Berman P, Zelikovsky A. HCV quasispecies assembly using network flows. In Bioinformatics Research and Applications - Fourth International Symposium, ISBRA 2008, Proceedings. 2008. p. 159-170. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). https://doi.org/10.1007/978-3-540-79450-9_15