RSS 2.0
  • Home
  • About
  • Aligners
  • Genomes
  • VarScan
  •  

    AGBT: PacBio Somewhat Unveiled

    February 27th, 2010

    Yesterday the Pacific Biosciences commercial instrument (photo) was at last unveiled to a packed room of conference attendees. The road to this third generation sequencer’s release has been paved with nearly $300 million of investment capital since leaving a basement at Cornell University. PacBio, in addition to becoming something of a media darling, has quietly swelled to a several-hundred-employee company.

    Since last year, PacBio claims to have achieved read lengths of up to 10.3 kbp, although I haven’t spoken to anyone outside the company who has seen reads that long. Even so, a few vignettes presented in the workshop told of how PacBio has been applied to influenza strain identification and detection of stuctural variants (SVs).

    Strobe Sequencing in Real Time

    Of particular interest is the “strobe sequencing” mode of the instrument, in which the detection laser is turned off for precise amounts of time to generate mate-pair-like reads spanning large fragments. This feature relies on the real time sequencing, which occurs at a very consistent per-base rate. In fact, it’s possible to infer sequence insertions and deletions as spikes or dips (respectively) in the time required to sequence a template of known size.

    Kinetic Variation Applications

    The kinetics of real-time sequencing offer an informative new dimension of information from the PacBio data. In a talk today, Eric Schadt of PacBio showed that the kinetics of sequencing vary significantly for “modified” bases, i.e. methylated residues. In a collaboration with Carrie Harwood (UW), PacBio is sequencing the genomes and transcriptomes of 132 isolates of a hydrogen-producing species of Rhodopseudomonas. It turned out that kinetic variation exists at many bases as a “mixture” of sequencing times; by mining these, they identified thousands of methylated bases that caused up to 12-fold variation in sequencing kinetics.

    Burning Questions Unanswered

    Personally, I was not entirely satisfied with the PacBio workshop. When it opened for questions, I asked the first: whether PacBio had improved any upon the “dark bases” that go by undetected in single molecule sequencing. The presenter — Stephen Turner of PacBio — first gave me a nice 2-minute lecture on why there are no such thing as “dark bases” on PacBio’s sequencing platform due to its inherent awesomeness (sarcasm mine). There is still a problem with “missed bases” but Turner was  almost comically evasive (as Daniel MacArthur put it) in stating how often they occur. The next question concerned read lengths, a second topic on which Turner refused to provide concrete information.

    Thus, I find myself cautious in my excitement about this new platform, and will reserve judgment until later this year, when the first of the golden-ticket early access partners begin generating data on their own PacBio SMRT sequencers.

    Read More About PacBio at AGBT:

    Daniel MacArthur at GeneticFuture

    Kevin Davies at Bio-IT World

    AddThis Social Bookmark Button

    AGBT: Cancer Genomics at St. Judes, Harvard, WashU

    February 26th, 2010

    Today’s plenary session included some great talks on cancer genomics. Keynote speaker Jim Downing of St. Jude Children’s Research Hospital gave a talk on acute leukemia, in which he openly admitted that he would show no next-gen sequencing data. Instead, he gave a very nice overview of the four biological processes that are dysregulated in acute leukemia:

    1. Self renewal. With a few exceptions, pre-leukemic cells have only limited self-renewal capacity. AML1-ETO is often altered to overcome this limitation.
    2. Response to growth factor signals. The BCR-ABL gene fusion is a classic example of an alteration that lets cells grow in the absence of growth factors.
    3. Differentiation. Leukemic cells block this process via alterations in PML-RARA, PAX5, EBF, BTLA, and others).
    4. Apoptosis. This normal pathway of cell death is circumvented in leukemia via alterations in CDKN2A/B, BT6, and the RB pathway.

    Non-NGS Molecular Profiling

    Dr. Downing’s group uses several molecular techniques to characterize pediatric leukemias, including Affy SNP-chip (for copy number alterations), cytogenetics/FISH, and targeted sequencing in a handful of genes. In a study of 242 pediatric acute lymphoblastic leukemia (ALL) tumors with matched controls, a surprisingly small number of copy number alterations were observed.

    There were a few significantly altered genes, however. PAX5 was deleted or amplified in 30% of B-cell ALLs; some apparent 3′ deletions proved to be fusion events with ETV6, FOXP1, or other genes. Another gene, IK2F1, was deleted in 83.7% of ALLs that were BCR-ABL positive. These and other findings convinced the audience, I think, that there is much to be learned, even about the best-characterized human cancer, and even without next-generation sequencing technologies.

    Cancer Genomes and Translational Oncology

    Levi Garraway of Harvard Medical School spoke about how next-generation sequencing can be applied to translational oncology. He offered a clinical perspective to cancer genomics, which has somewhat different requirements from basic research:

    • Targeted. The mutations and genes to be assessed in clinical samples must already be known and well-characterized.
    • Resource-efficient. To minimize costs, clinicians are interested in tests that make efficient use of sample and equipment resources.
    • Actionable. Only mutations and biomarkers that give actionable information, i.e., “the patient has X mutation, so we should administer drug Y” are valuable in a clinical setting.

    A resource compiled by Dr. Garraway and others, called OncoMap, offers a database of known oncogenic mutations that can be tested (on frozen or FFPE samples) for just $200 per patient. Granted, it includes only 46 mutations from 34 cancer genes, but each provides a validated, actionable course in regard to treatment.

    The speaker admitted that ideally, a systematic mutational profiling method would have high sensitivity and specificity, testing both oncogenes and tumor suppressors. It would also detect multiple alteration types (SNVs, CNAs, etc) and be able to use either DNA or RNA, or both. And it would have an “acceptable” turnaround time, say 2 weeks. This is what clinicians want, and it may be that hybrid capture approaches may offer the best solution. More on that in another post.

    Elaine Mardis: Single Molecule Sequencing in Cancer

    My favorite talk of the day (obviously) was by genome center co-director Est-judes-cancer-projectlaine Mardis, who presented WashU’s pipeline for detecting and validating somatic mutations from whole-genome sequencing. Our pipeline has evolved over the course of AML1, AML2, and other cancer whole-genome sequencing projects, and now has the highly automated capacity to handle the coming 600 tumor-normal pairs to be sequenced for the Pediatric Cancer Genome Project (PCGP).

    Dr. Mardis also discussed our methods for systematically assessing the prevalence of somatic mutations (within a tumor population) as well as their recurrence in tumors of the same or other types. Prevalence is important because the greater fraction of tumor cells that share a mutation, the more likely it occurred early during progression. By similar reasoning, assessing the recurrence of mutations in a tumor type provides a measure of their importance for disease development.

    The Importance of Recurrence Testing

    IDH1 demonstrates this principle well.  Initially identified as a key cancer gene in glioma by Bert Vogelstein’s group at Johns Hopkins, the isocitrate dehydrogenase 1 (IDH1) gene was also mutated in AML2, and, in a screen of hundreds of AML samples, proved to be recurrent. At least two large-scale studies of AML have since replicated the common incidence of IDH1 mutations in AML and other cancers.

    Third Generation Sequencing in Cancer

    Finally, the speaker presented some recent experiments that we’ve performed using the Pacific Biosystems Single Molecule Real Time sequencer on in-house cancer samples. In work that’s part of a manuscript in submission, the accuracy and sensitivity of the SMRT sequencer were assessed on GBM and AML tumor samples that had already been characterized by whole genome sequencing. In general, the results were promising – 25 of 25 known somatic mutations were identified in SMRT sequencing of PCR products, although 6 were detected at lower-than-expected prevalence.

    Somatic mutations from AML2 were also used to create mixed PCR libraries of various tumor cellularities from 50% to 100%. It was apparent that “tier 1″ somatic coding mutations were more reliably detected on Pac Bio than tier 2 and tier 3 mutations, and that there’s a slight bias against detecting C to T mutations. That said, the ability of SMRT sequencing to detect somatic mutations even at low tumor cellularities is promising.

    AddThis Social Bookmark Button

    AGBT: Focus on Cancer Genomics

    February 26th, 2010

    As usual, the quality of the scientific presentations at this meeting has been outstanding. The weather, too, has improved at last:

    p_00014

    There are too many to cover (or even attend) completely, but one area of interest with a strong focus this year is cancer genomics. Yesterday during plenary sessions, Stacey Gabriel of the Broad Institute of MIT and Harvard presented sequencing of multiple myeloma, a liquid tumor affecting 50,000 people in the US. Around 5,200 gigabases of sequence was generated across 26 tumor samples and matched controls, yielding ~30x average depth per genome. Their mutation detection pipeline achieved an admirable validation rate for somatic SNVs (95%). Short indels were more challenging (~50% validated), and candidate rearrangements even more so (30-50% validated). However, their study validated ~40 somatic mutations per tumor, implicating known MM genes (NRAS, KRAS, TP53) as well as novel ones (DIS3, FAM46C).

    Elliott Margulies on Melanoma

    Last night, there was a concurrent session devoted to cancer genomics. Eliott Margulies (NIH/NHGRI) led the lineup with his work sequencing the tumor genome and matched normal of a melanoma patient. Using the Illumina platform (2×100 bp), his group achieved 36x and 43x haploid coverage for tumor and normal, respectively, with ~99% of the genome covered by at least one read. Much of the talk was devoted to their analysis pipeline, summarized as:

    1. Initial alignment of Illumina reads with ELAND
    2. Partitioning the reads into “genome” bins of several kilobases
    3. Local realignment with cross_match in highly parallelized fashion
    4. SNV calling with their “Most Probable Genotype” (MPG) method
    5. Removal of variants with any evidence in the Germline, or ones in dbSNP

    The 175,768 novel tumor-specific SNVs were classified as coding (807) or noncoding (174,961). Some 513 of 807 coding variants were nonsynonymous. Of these, 101 were selected for validation; 84 got validation results and 75 somatic coding mutations (89%) were confirmed. Unsurprisingly, Dr. Margulies used his group’s expertise in comparative genomics to closely examine the noncoding variants as well. His group recently annotated “Chai” regions of the human genome, which bear evidence of evolutionary constraint that suggest functional relevance. Some 10,285 of the 174,961 fell within Chai regions, and among them were ~2,000 variants predicted to dramatically alter the local structure of DNA (suggesting regulatory changes).

    Sequencing Pre- and Post-Treatment Lung Cancer

    Ian Bosdet of BC Cancer Agency presented some very interesting work on mutational profiling of pre- and post-treatment lung cancer tumors. His group had the opportunity to participate in a clinical trial at BCCA in which carefully-selected, treatment-naive NSCLC patients underwent a standard therapeutic program. First, each patient underwent a pre-treatment evaluation and biopsy. Next, they received erlotinib (an EGFR inhibitor) until the disease inevitably progressed. Then, another biopsy that was sent for pathology review, as well as DNA/RNA extraction for sequencing. Transcriptome sequencing yielded some interesting findings. For example, the expression of one gene (IER5L or IER5C, it’s hard to read my own handwriting) was highly expressed in smokers that did not respond to treatment. A screen of unmapped transcript reads against viral genomes revealed the presence of Epstein-Barr Virus transcripts in one tumor that was later re-classified as EBV-positive lymphadenocarcinoma (?).

    Mutational profiling for three patients was obtained via exome capture (Agilent) and sequencing of normal, pre-treatment tumor, and post-treatment tumor samples. Somatic mutations in PHACTR2 were seen only in pre-treatment samples. Mutations in a few genes (PRMT10, RanBP2) were found at both times, but a few (YY1AP1, SNX9) were only present after treatment, suggesting a role for these genes in progressive disease.

    AddThis Social Bookmark Button

    AGBT 2010: First Impressions

    February 25th, 2010
    p_00010

    Only in Florida: Jellyfish Aquarium

    I’m in the midst of my first full day at Marco Island. More than any other meeting that I’ve attended, AGBT has a remarkable corporate presence. Life Technologies seems to be the biggest sponsor; you can’t look anywhere without seeing a banner that promotes the new SOLiD4 system. Apparently I’m doing a poor job at keeping up with SOLiD, as I’d only just heard about SOLiD3. I spoke to Richard Gibbs at a coffee break, and he mentioned that SOLiD4 is an upgrade, not a new machine. Must be nice.

    Caliper Life Sciences, a maker of microfluidics equipment for next-generation sequencing, won favor with many attendees by hanging chocolate “chips” (mini bars) on the doorknobs of every AGBT attendee’s room in the hotel to promote their recently-launched LabChip XT. I learned of this company only a week or so ago, when my colleague Vince Magrini was named to their scientific advisory board.

    PacBio Instrument Unveiled

    Pacific Biosciences unveiled their coveted SMRT sequencing instrument last night in a small, invitation-only event in their suite. Sadly, I wasn’t invited, but I’m told the guest list was very exclusive. Most likely it was restricted to directors from the ten initial PacBio customers that were announced last week. Tonight, PacBio hosts a roundtable called Global Challenges, Genomic Solutions that will be moderated by Charlie Rose.

    Other Players in the Field

    This morning at breakfast, Agilent Technologies was trading SureSelect T-shirts for surveys that assessed respondents’ interest in exome capture, which (thus far) seems to be the recurrent hot topic at AGBT. Things have been quiet from some of the other large sponsors, including Illumina, Complete Genomics, Roche, and others. I’m sure that their hour of glory will come soon enough.

    AddThis Social Bookmark Button