hg38 CCDS Gene

JavaScript is disabled in your web browser

You must have JavaScript enabled in your web browser to use the Genome Browser

hg38 CCDS Gene

Consensus CDS Gene CCDS82447.1

Gene	NRXN1
Description	neurexin-1 isoform beta2
Sequences	CDS, protein, genomic
CCDS database	CCDS82447.1

Associated Sequences

	mRNA	Protein
UCSC Genes	ENST00000628364.2
RefSeq	NM_001330091.1	NP_001317020.1
RefSeq	XM_017005336.1	XP_016860825.1
Vega	OTTHUMT00000481154	OTTHUMP00000278424
Ensembl	ENST00000628364	ENSP00000485815

Note: mRNA and protein sequences in other gene collections may differ from the CCDS sequences.

RefSeq summary of CCDS82447.1

This gene encodes a single-pass type I membrane protein that belongs to the neurexin family. Neurexins are cell-surface receptors that bind neuroligins to form Ca(2+)-dependent neurexin/neuroligin complexes at synapses in the central nervous system. This complex is required for efficient neurotransmission and is involved in the formation of synaptic contacts. Three members of this gene family have been studied in detail and are estimated to generate over 3,000 variants through the use of two alternative promoters (alpha and beta) and extensive alternative splicing in each family member. Recently, a third promoter (gamma) was identified for this gene in the 3' region. Mutations in this gene are associated with Pitt-Hopkins-like syndrome-2 and may contribute to susceptibility to schizophrenia. [provided by RefSeq, Aug 2016]. Sequence Note: The RefSeq transcript and protein were derived from genomic sequence to make the sequence consistent with the reference genome assembly. The genomic coordinates used for the transcript record were based on alignments.

Data schema/format description and download

Go to CCDS track controls

Data last updated at UCSC: 2019-10-03

Description

This track shows human genome high-confidence gene annotations from the Consensus Coding Sequence (CCDS) project. This project is a collaborative effort to identify a core set of human protein-coding regions that are consistently annotated and of high quality. The long-term goal is to support convergence towards a standard set of gene annotations on the human genome.

Collaborators include:

European Bioinformatics Institute (EBI)
National Center for Biotechnology Information (NCBI)
University of California, Santa Cruz (UCSC)
Wellcome Trust Sanger Institute (WTSI)

For more information on the different gene tracks, see our Genes FAQ.

Methods

CDS annotations of the human genome were obtained from two sources: NCBI RefSeq and a union of the gene annotations from Ensembl and Vega, collectively known as Hinxton.

Genes with identical CDS genomic coordinates in both sets become CCDS candidates. The genes undergo a quality evaluation, which must be approved by all collaborators. The following criteria are currently used to assess each gene:

an initiating ATG (Exception: a non-ATG translation start codon is annotated if it has sufficient experimental support), a valid stop codon, and no in-frame stop codons (Exception: selenoproteins, which contain a TGA codon that is known to be translated to a selenocysteine instead of functioning as a stop codon)
ability to be translated from the genome reference sequence without frameshifts
recognizable splicing sites
no intersection with putative pseudogene predictions
supporting transcripts and protein homology
conservation evidence with other species

A unique CCDS ID is assigned to the CCDS, which links together all gene annotations with the same CDS. CCDS gene annotations are under continuous review, with periodic updates to this track.

Credits

This track was produced at UCSC from data downloaded from the CCDS project web site.

References

Hubbard T, Barker D, Birney E, Cameron G, Chen Y, Clark L, Cox T, Cuff J, Curwen V, Down T et al. The Ensembl genome database project. Nucleic Acids Res. 2002 Jan 1;30(1):38-41. PMID: 11752248; PMC: PMC99161

Pruitt KD, Harrow J, Harte RA, Wallin C, Diekhans M, Maglott DR, Searle S, Farrell CM, Loveland JE, Ruef BJ et al. The consensus coding sequence (CCDS) project: Identifying a common protein-coding gene set for the human and mouse genomes. Genome Res. 2009 Jul;19(7):1316-23. PMID: 19498102; PMC: PMC2704439

Pruitt KD, Tatusova T, Maglott DR. NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins. Nucleic Acids Res. 2005 Jan 1;33(Database issue):D501-4. PMID: 15608248; PMC: PMC539979