Difference between revisions of "SEED Viewer Manual/Evidence"

From TheSeed
Jump to navigation Jump to search
Line 23: Line 23:
 
This section graphically lists evidence for similarities to other features in the SEED database (or also other databases). The '''E-Value Key''' shown on the top defines the colors that are used to display different E-Value ranges for the similarities to the hit features. Hovering over the E-Value Key shows the value range for each color.
 
This section graphically lists evidence for similarities to other features in the SEED database (or also other databases). The '''E-Value Key''' shown on the top defines the colors that are used to display different E-Value ranges for the similarities to the hit features. Hovering over the E-Value Key shows the value range for each color.
  
Each similarity is represented by two bars, showing the alignment of the similarity. The first bar is the ''query'' feature, the second the ''hit'' feature. The length of the outside box shows the complete length of the respective sequence. The color of the outside box represents the range of the evalue score according to the E-Value Key bar. The length of the inner (white) box depicts the actual section of the sequence the similarity to the other feature is in.
+
Each similarity is represented by two bars, showing the alignment of the similarity. The first bar is the ''query'' feature, the second the ''hit'' feature. The abbreviation in front of this bar informs you about the organism the hit feature is in. Hover over the abbreviation to get the complete organism name. Behind the box you can find the function of the hit feature.
 +
 
 +
The length of the outside box shows the complete length of the respective sequence. The color of the outside box represents the range of the evalue score according to the E-Value Key bar. The length of the inner (white) box depicts the actual section of the sequence the similarity to the other feature is in.
  
 
[[Image:EvidenceSims1.png]]
 
[[Image:EvidenceSims1.png]]

Revision as of 06:05, 25 November 2008

The Evidence Page is divided into two parts via a TabView: The Visual Protein Evidence and the Tabular Protein Evidence.

Visual Protein Evidence

After loading the Evidence Page, the first tab of the TabView is selected. It visually shows different pre-computed tool results for the given feature. In this view, you can see evidence for Location of the product of the gene in the cell, evidence for protein Domains and evidence that show Similarities to other features.

Location

Location stand for location of the product of the feature in the cell. This section presents output for tools that look for transmembrane helices (TM) or signal peptides (SP) in the feature. In the example, you can see five transmembrane helices in the protein identified via the Phobius tool. They are visualized as little boxes, and their location on the line depicts the location of the transmembrane helices in the protein.

EvidenceLocation.png

Domains

This section shows pre-computed domains for the selected feature. In the example, you can find a CDD domain and a Pfam domain for the feature. The blue bar marks the location of the domain found in the protein (the line depicts the whole length of the protein).

Additional tools can be accessed via the Feature Tools Menu in the menu bar.

EvidenceDomain.png

Similarities

This section graphically lists evidence for similarities to other features in the SEED database (or also other databases). The E-Value Key shown on the top defines the colors that are used to display different E-Value ranges for the similarities to the hit features. Hovering over the E-Value Key shows the value range for each color.

Each similarity is represented by two bars, showing the alignment of the similarity. The first bar is the query feature, the second the hit feature. The abbreviation in front of this bar informs you about the organism the hit feature is in. Hover over the abbreviation to get the complete organism name. Behind the box you can find the function of the hit feature.

The length of the outside box shows the complete length of the respective sequence. The color of the outside box represents the range of the evalue score according to the E-Value Key bar. The length of the inner (white) box depicts the actual section of the sequence the similarity to the other feature is in.

EvidenceSims1.png

EvidenceHoverSim.png

EvidenceFil1.png

Tabular Protein Evidence

Similarities

EvidenceFilter.png

Domains

This section shows pre-computed domains for the selected feature. In the example, you can find a CDD domain and a Pfam domain for the feature. The blue bar marks the location of the domain found in the protein (the line depicts the whole length of the protein).

The table lists the Domain DB (the database for the domain that was hit), the ID in the domain database, the Name of the domain, the Location of the hit in the selected feature, the Score for the hit against the domain, as well as the Function of the domain.

The table can be exported using the export table button.

Additional tools can be accessed via the Feature Tools Menu in the menu bar.

EvidenceDomTable.png

Identical Proteins

Essentially Identical Proteins are proteins that share a common sequence, but the start position of the proteins may vary a little. This definition was made because in different databases or close strains of organisms, it often happens that a protein is present, but the start position may be shifted in the finding genes step. So essentially, this table shows aliases of the feature that were based on protein identity.

The first column of the table shows the Database the alias can be found in, while the second column (ID) offers the alias name and a link to the protein in the respective database. The following two columns describe the Organism and the Assignment for the feature for the alias.

EvidenceEIPs.png

Functionally Coupled

This table lists all functionally coupled genes in the organism. You can see the Score, the ID of the feature and the Function of the feature.

EvidenceFCs.png