Schema for UW DNaseI HS - DNaseI Hypersensitivity by Digital DNaseI from ENCODE/University of Washington
  Database: mm9    Primary Table: wgEncodeUwDnaseHlbudCd1ME11halfPkRep1    Row Count: 157,008   Data last updated: 2011-06-29
Format description: BED6+4 Peaks of signal enrichment based on pooled, normalized (interpreted) data.
On download server: MariaDB table dump directory
fieldexampleSQL type info description
bin 610smallint(5) unsigned range Indexing field to speed chromosome range queries.
chrom chr1varchar(255) values Reference sequence chromosome or scaffold
chromStart 3390100int(10) unsigned range Start position in chromosome
chromEnd 3390250int(10) unsigned range End position in chromosome
name .varchar(255) values Name given to a region (preferably unique). Use . if no name is assigned
score 101int(10) unsigned range Indicates how dark the peak will be displayed in the browser (0-1000)
strand .char(2) values + or - or . for unknown
signalValue 15float range Measurement of average enrichment for the region
pValue 10.4135float range Statistical significance of signal value (-log10). Set to -1 if not used.
qValue -1float range Statistical significance with multiple-test correction applied (FDR -log10). Set to -1 if not used.
peak -1int(11) range Point-source called for this peak; 0-based offset from chromStart. Set to -1 if no point-source called.

Sample Rows
 
binchromchromStartchromEndnamescorestrandsignalValuepValueqValuepeak
610chr133901003390250.101.1510.4135-1-1
612chr135852803585430.101.135.43209-1-1
612chr136376603637810.102.2416.2976-1-1
612chr136391603639310.101.1810.4135-1-1
612chr136414203641570.102.2226.9001-1-1
612chr136608603661010.117.189284.034-1-1
612chr136617603661910.105.5966.6548-1-1
612chr136626003662750.101.113.65553-1-1
615chr139918003991950.101.1713.6808-1-1
617chr143226804322830.102.1911.2048-1-1

Note: all start coordinates in our database are 0-based, not 1-based. See explanation here.

UW DNaseI HS (wgEncodeUwDnase) Track Description
 

Description

This track was produced as part of the mouse ENCODE Project. It shows DNaseI sensitivity measured genome-wide in mouse tissues and cell lines using the Digital DNaseI methodology (see below), and DNaseI hypersensitive sites. DNaseI has long been used to map general chromatin accessibility and DNaseI hypersensitivity is a universal feature of active cis-regulatory sequences. The use of this method has led to the discovery of functional regulatory elements that include enhancers, insulators, promoters, locus control regions and novel elements. For each experiment (tissue/cell type), this track shows DNaseI sensitivity as a continuous function using sequencing tag density (Signal), and discrete loci of DNaseI sensitive zones (HotSpots) and hypersensitive sites (Peaks).

Display Conventions and Configuration

This track is a multi-view composite track that contains multiple data types (views). For each view, there are multiple subtracks that display individually on the browser. Instructions for configuring multi-view tracks are here. This track contains the following views:

HotSpots
DNaseI sensitive zones identified using the HotSpot algorithm.
Peaks
DNaseI hypersensitive sites (DHSs) identified as signal peaks within FDR 1.0% hypersensitive zones.
Signal
The density of tags mapping within a 150 bp sliding window (at a 20 bp step across the genome).

DNaseI sensitivity is shown as the absolute density of in vivo cleavage sites across the genome mapped using the Digital DNaseI methodology (see below). Data have been normalized to 25 million reads per cell type.

Metadata for a particular subtrack can be found by clicking the down arrow in the list of subtracks.

Methods

Cells were grown according to the approved ENCODE cell culture protocols. Fresh tissues were harvested from mice and the nuclei prepared according to the tissue-appropriate protocol. Digital DNaseI was performed by DNaseI digestion of intact nuclei, isolating DNaseI 'double-hit' fragments as described in Sabo et al. (2006), and direct sequencing of fragment ends (which correspond to in vivo DNaseI cleavage sites) using the Illumina IIx (and Illumina HiSeq by early 2011) platform (36 bp reads). Uniquely-mapping high-quality reads were mapped to the genome using Bowtie. DNaseI sensitivity is directly reflected in raw tag density, which is shown in the track as density of tags mapping within a 150 bp sliding window (at a 20 bp step across the genome). DNaseI sensitive zones (HotSpots) were identified using the HotSpot algorithm described in Sabo et al. (2004). False discovery rate thresholds of 1.0% (FDR 1.0%) were computed for each cell type by applying the HotSpot algorithm to an equivalent number of random uniquely-mapping 36mers. DNaseI hypersensitive sites (DHSs or Peaks) were identified as signal peaks within FDR 1.0% hypersensitive zones using a peak-finding algorithm (I-max).

Verification

Data were verified by sequencing biological replicates displaying correlation coefficient > 0.9.

Release Notes

This is Release 2 (September 2012) of this track. It adds 32 new experiments including 22 new cell lines and 4 new treatments.

Credits

These data were generated by the UW ENCODE group.

Contact: Richard Sandstrom

References

John S, Sabo PJ, Thurman RE, Sung MH, Biddie SC, Johnson TA, Hager GL, Stamatoyannopoulos JA. Chromatin accessibility pre-determines glucocorticoid receptor binding patterns. Nat Genet. 2011 Mar;43(3):264-8.

Sabo PJ, Hawrylycz M, Wallace JC, Humbert R, Yu M, Shafer A, Kawamoto J, Hall R, Mack J, Dorschner MO et al. Discovery of functional noncoding elements by digital analysis of chromatin structure. Proc Natl Acad Sci U S A. 2004 Nov 30;101(48):16837-42.

Sabo PJ, Kuehn MS, Thurman R, Johnson BE, Johnson EM, Cao H, Yu M, Rosenzweig E, Goldy J, Haydock A et al. Genome-scale mapping of DNase I sensitivity in vivo using tiling DNA microarrays. Nat Methods. 2006 Jul;3(7):511-8.

Data Release Policy

Data users may freely use ENCODE data, but may not, without prior consent, submit publications that use an unpublished ENCODE dataset until nine months following the release of the dataset. This date is listed in the Restricted Until column, above. The full data release policy for ENCODE is available here.