Skip to content

Finding genotyped accessions carrying a specific haplotype

New feature in v3.7.3

  • Added the ability to search a VCF file for accessions carrying a specific SNP or haplotype.
  • Try out this few feature now on agg.plantinformatics.io - sign up for a free account now to get started!

This section assumes some working knowledge of using the Genotype tab

Introduction

In this example, we will search AGG wheat genotypes (Triticum aestivum - IWGSC_RefSeq_v2.1 - Genotypes - AGG Filled-in Release 1) for accessions carrying the Yr34/Yr48 haplotype located at the end of Chr5A.

finalimage

For more information on how we found the accessions carrying Yr34/Yr48 where located at the end of Chr5A using Pretzel please see User Story 2.

Finding the VCF file

Start by adding the VCF file which contains all of the AGG wheat accessions that we will eventually search. This can be found using the search pannel on the left hand side of the screen and searching for agg wheat filled. This should help us find Triticum aestivum - IWGSC_RefSeq_v2.1 - Genotypes - AGG Filled-in Release 1.

Image

We are interested in the end of Chr5A, so open up dataset and load in Chr5A. Next, to want to navigate to the end of Chr5A. This can be done by clicking and dragging over the region of interest and pressing the zoom button that appears at the bottom. Then select the region again, making sure to cover up to the end of the chromosome. This is done to select the features we want to visualise. Next, open the Genotypes tab in the right panel, which displays the selected SNPs.

Image

Selecting the haplotype

To add the haplotype we want to search, click on the specific allele located in the REF and ALT columns. If you made a mistake, you can click on the same allele again to de-select it.

Example specific information

To search for Yr34/Yr48, we can use the pattern identified already in User Story 2. finalimage

Image

Filtering the accessions for the haplotype

Once the haplotype has been defined, open up the sample selection menu by clicking on the cog icon within the Genotypes tab and select the "Filter by defined haplotype" option. This will filter the accessions for only those that match the selected haplotype. This will then update the number of haplotypes and update the list of selectable accessions in the sample selection menu.

In this example, we see a change from AGG wheat 12,606 accessions (the full list of accessions in the file) to a filtered list of 333 accessions which match this exact haplotype.

Image

Viewing and using the results

There are a couple of ways to view the results; the easiest is to display this within Pretzel. By going down to the accessions selection box click on the first accession, hold Shift then scroll down to the bottom of the list to select the last accession.

This should select all 333 of the filtered accessions. Then click VCF Search to load the genotypes for these accessions into the genotype table.

Image

Image

Exporting the results (optional)

There are two main ways to export the results of the search. The first is to export the genotype table as a VCF file. This can be done by clicking the VCF Download button above the genotype table. Image

The other way to export is within the accession selection menu. This can be done by clicking the copy to clipboard button. This will copy the list of accessions to the clipboard in a list. Image