This track shows all occurrences of a selected short motif within the displayed position range of
the assembly sequence. It is useful for finding oligonucleotides, restriction sites, or other
recurring short sequences within the assembly. In full display mode, each motif occurrence is
labeled by the strand on which the match is located, followed by the starting coordinate of the
match. In cases where the input motif sequence is identical to its reverse complement, only the
match on the "+" strand is shown.
The track may be configured to search for any short sequence of 2 - 30 bases in length. Sequences
may include IUPAC ambiguity codes. To change the motif,
open the track's description page (by clicking the track control label or the mini-button to the
left of the track), then type a new sequence into the text box.
To see how to create a bed file of the short match data see this mailing list question
here.
| Category |
Motif (Consensus) |
Name / Function |
Notes |
| Transcription initiation |
TATAWAAR |
TATA box |
Classic Pol II promoter element (~30 bp upstream of TSS). |
| Transcription initiation |
CCATNTT |
YY1 binding motif |
Common promoter-associated TF motif. |
| Transcription initiation |
YYANWYY |
Initiator (Inr) |
Anchors transcription start when no TATA box is present. |
| Transcription termination |
AATAAA |
Polyadenylation signal (PAS) |
Main poly(A) signal; variants include ATTAAA, TATAAA. |
| RNA modification |
DRACH |
m⁶A methylation motif |
Core motif for METTL3/METTL14 deposition. |
| Splice donor |
MAGGTRAGT |
5′ splice site (donor) |
exon–intron boundary is after MAG. |
| Splice acceptor |
YYYYYYYYYNCAGG |
3′ splice site (acceptor), splice site is before last G |
Long pyrimidine tract + invariant AG. |
| Branch point |
YNYURAY |
Splicing branch point |
Located upstream of the 3′ splice site; weak but conserved. |
| Transcription factor |
GATA |
GATA family motif |
Recognized by GATA1/2/3/4. |
| Transcription factor |
CACGTG |
E-box |
Recognized by MYC/MAX and USF families. |
| Transcription factor |
TGASTCA |
AP-1 motif (Jun/Fos) |
Key stress-response motif; S = G/C. |
| Transcription factor |
CCGCCC |
SP1 motif |
Classic GC-rich promoter element. |
| Transcription factor |
GCGTG |
HIF1A/HRE variant |
Hypoxia response element; canonical form is RCGTG. |
| Transcription factor |
GATTA |
Homeobox (HOX) core |
Generic homeodomain preference; flanking bases refine specificity. |
| RNA editing |
WAR (local context) |
ADAR A-to-I editing preference |
Less strict motif; enriched in dsRNA structures. |
| Replication origin |
WAWTTDDWW |
ORC-associated origin motif |
Weak consensus; human origins have low sequence specificity. |