Schema for Other ESTs - Non-Human ESTs from GenBank
  Database: hg38    Primary Table: xenoEst    Row Count: 49,226,383   Data last updated: 2020-08-17
Format description: Summary info about a patSpace alignment
fieldexampleSQL type description
bin 585smallint(5) unsigned Indexing field to speed chromosome range queries.
matches 41int(10) unsigned Number of bases that match that aren't repeats
misMatches 10int(10) unsigned Number of bases that don't match
repMatches 0int(10) unsigned Number of bases that match but are part of repeats
nCount 0int(10) unsigned Number of 'N' bases
qNumInsert 0int(10) unsigned Number of inserts in query
qBaseInsert 0int(10) unsigned Number of bases inserted in query
tNumInsert 0int(10) unsigned Number of inserts in target
tBaseInsert 0int(10) unsigned Number of bases inserted in target
strand -+char(2) + or - for strand. First character query, second target (optional)
qName AA542579varchar(255) Query sequence name
qSize 418int(10) unsigned Query sequence size
qStart 367int(10) unsigned Alignment start position in query
qEnd 418int(10) unsigned Alignment end position in query
tName chr1varchar(255) Target sequence name
tSize 248956422int(10) unsigned Target sequence size
tStart 13003int(10) unsigned Alignment start position in target
tEnd 13054int(10) unsigned Alignment end position in target
blockCount 1int(10) unsigned Number of blocks in alignment
blockSizes 51,longblob Size of each block
qStarts 0,longblob Start of each block in query.
tStarts 13003,longblob Start of each block in target.

Connected Tables and Joining Fields
        hg38.all_est.qName (via xenoEst.qName)
      hg38.all_mrna.qName (via xenoEst.qName)
      hg38.refGene.name (via xenoEst.qName)
      hg38.refSeqAli.qName (via xenoEst.qName)
      hg38.xenoMrna.qName (via xenoEst.qName)
      hg38.xenoRefGene.name (via xenoEst.qName)
      hg38.xenoRefSeqAli.qName (via xenoEst.qName)
      hgFixed.gbCdnaInfo.acc (via xenoEst.qName)
      hgFixed.gbMiscDiff.acc (via xenoEst.qName)
      hgFixed.gbSeq.acc (via xenoEst.qName)
      hgFixed.gbWarn.acc (via xenoEst.qName)
      hgFixed.imageClone.acc (via xenoEst.qName)

Sample Rows
 
binmatchesmisMatchesrepMatchesnCountqNumInsertqBaseInserttNumInserttBaseInsertstrandqNameqSizeqStartqEndtNametSizetStarttEndblockCountblockSizesqStartstStarts
5854110000000-+AA542579418367418chr12489564221300313054151,0,13003,
585332440014336778-+AA38872850788507chr124895642217625247794117,147,69,43,0,117,264,376,17625,17914,18267,24736,
7340361001434177304+-AA3887285070507chr124895642217625195393588,44,68,147,117,0,88,175,243,390,248761029,248761118,248938087,248938361,248938680,
5864110000000-+AA542579418367418chr1248956422183522183573151,0,183522,
586400580024446781-+AA3887285075507chr12489564221881491953886117,48,98,69,42,84,0,117,166,264,376,418,188149,188438,188486,188790,195261,195304,
58610244000000-+AA978407761584730chr12489564222586552588011146,31,258655,
5865411000000-+AA803699632528593chr1248956422258736258801165,39,258736,
7354400001234977++AA241090499380438chr1248956422447927682962220,38,380,400,447927,682924,
73865001693235036++AA414142414254414chr1248956422447927683054420,38,12,21,254,274,312,393,447927,682924,682963,683033,
73926001693235036++AA222500507310477chr1248956422447927683061420,38,12,28,310,330,368,449,447927,682924,682963,683033,

Note: all start coordinates in our database are 0-based, not 1-based. See explanation here.

Other ESTs (xenoEst) Track Description
 

Description

This track displays translated blat alignments of expressed sequence tags (ESTs) in GenBank from organisms other than human. ESTs are single-read sequences, typically about 500 bases in length, that usually represent fragments of transcribed genes.

Display Conventions and Configuration

This track follows the display conventions for PSL alignment tracks. In dense display mode, the items that are more darkly shaded indicate matches of better quality.

The strand information (+/-) for this track is in two parts. The first + or - indicates the orientation of the query sequence whose translated protein produced the match. The second + or - indicates the orientation of the matching translated genomic sequence. Because the two orientations of a DNA sequence give different predicted protein sequences, there are four combinations. ++ is not the same as --, nor is +- the same as -+.

The description page for this track has a filter that can be used to change the display mode, alter the color, and include/exclude a subset of items within the track. This may be helpful when many items are shown in the track display, especially when only some are relevant to the current task.

To use the filter:

  1. Type a term in one or more of the text boxes to filter the EST display. For example, to apply the filter to all ESTs expressed in a specific organ, type the name of the organ in the tissue box. To view the list of valid terms for each text box, consult the table in the Table Browser that corresponds to the factor on which you wish to filter. For example, the "tissue" table contains all the types of tissues that can be entered into the tissue text box. Multiple terms may be entered at once, separated by a space. Wildcards may also be used in the filter.
  2. If filtering on more than one value, choose the desired combination logic. If "and" is selected, only ESTs that match all filter criteria will be highlighted. If "or" is selected, ESTs that match any one of the filter criteria will be highlighted.
  3. Choose the color or display characteristic that should be used to highlight or include/exclude the filtered items. If "exclude" is chosen, the browser will not display ESTs that match the filter criteria. If "include" is selected, the browser will display only those ESTs that match the filter criteria.

This track may also be configured to display base labeling, a feature that allows the user to display all bases in the aligning sequence or only those that differ from the genomic sequence. For more information about this option, go to the Base Coloring for Alignment Tracks page. Several types of alignment gap may also be colored; for more information, go to the Alignment Insertion/Deletion Display Options page.

Methods

To generate this track, the ESTs were aligned against the genome using blat. When a single EST aligned in multiple places, the alignment having the highest base identity was found. Only alignments having a base identity level within 0.5% of the best and at least 96% base identity with the genomic sequence were kept.

Credits

This track was produced at UCSC from EST sequence data submitted to the international public sequence databases by scientists worldwide.

References

Benson DA, Cavanaugh M, Clark K, Karsch-Mizrachi I, Lipman DJ, Ostell J, Sayers EW. GenBank. Nucleic Acids Res. 2013 Jan;41(Database issue):D36-42. PMID: 23193287; PMC: PMC3531190

Benson DA, Karsch-Mizrachi I, Lipman DJ, Ostell J, Wheeler DL. GenBank: update. Nucleic Acids Res. 2004 Jan 1;32(Database issue):D23-6. PMID: 14681350; PMC: PMC308779

Kent WJ. BLAT - the BLAST-like alignment tool. Genome Res. 2002 Apr;12(4):656-64. PMID: 11932250; PMC: PMC187518