Short Match Track Settings
 
Perfect Match to Short Sequence   (All Mapping and Sequencing tracks)

Display mode:   

Short (2-30 base) sequence:
Examples: TATAWAAR, AAAAA

Assembly: Human Dec. 2013 (GRCh38/hg38)

Description

This track shows all occurrences of a selected short motif within the displayed position range of the assembly sequence. It is useful for finding oligonucleotides, restriction sites, or other recurring short sequences within the assembly. In full display mode, each motif occurrence is labeled by the strand on which the match is located, followed by the starting coordinate of the match. In cases where the input motif sequence is identical to its reverse complement, only the match on the "+" strand is shown.

The track may be configured to search for any short sequence of 2 - 30 bases in length. Sequences may include IUPAC ambiguity codes. To change the motif, open the track's description page (by clicking the track control label or the mini-button to the left of the track), then type a new sequence into the text box.

To see how to create a bed file of the short match data see this mailing list question here.

Example motifs

Category Motif (Consensus) Name / Function Notes
Transcription initiation TATAWAAR TATA box Classic Pol II promoter element (~30 bp upstream of TSS).
Transcription initiation CCATNTT YY1 binding motif Common promoter-associated TF motif.
Transcription initiation YYANWYY Initiator (Inr) Anchors transcription start when no TATA box is present.
Transcription termination AATAAA Polyadenylation signal (PAS) Main poly(A) signal; variants include ATTAAA, TATAAA.
RNA modification DRACH m⁶A methylation motif Core motif for METTL3/METTL14 deposition.
Splice donor MAGGTRAGT 5′ splice site (donor) exon–intron boundary is after MAG.
Splice acceptor YYYYYYYYYNCAGG 3′ splice site (acceptor), splice site is before last G Long pyrimidine tract + invariant AG.
Branch point YNYURAY Splicing branch point Located upstream of the 3′ splice site; weak but conserved.
Transcription factor GATA GATA family motif Recognized by GATA1/2/3/4.
Transcription factor CACGTG E-box Recognized by MYC/MAX and USF families.
Transcription factor TGASTCA AP-1 motif (Jun/Fos) Key stress-response motif; S = G/C.
Transcription factor CCGCCC SP1 motif Classic GC-rich promoter element.
Transcription factor GCGTG HIF1A/HRE variant Hypoxia response element; canonical form is RCGTG.
Transcription factor GATTA Homeobox (HOX) core Generic homeodomain preference; flanking bases refine specificity.
RNA editing WAR (local context) ADAR A-to-I editing preference Less strict motif; enriched in dsRNA structures.
Replication origin WAWTTDDWW ORC-associated origin motif Weak consensus; human origins have low sequence specificity.

Credits

This track was generated by Jim Kent of the UCSC Genome Bioinformatics Group.