Deyou Zheng
PI: Deyou Zheng, Ph.D.
Office: Price 320
Phone: 718-678-1217
Email: deyou.zheng
At einsteinmed.org

Welcome to Zheng Lab

Our lab is in the Departments of Genetics and Neurology at Albert Einstein College of Medicine. We are located in the Michael F. Price Center for Genetics and Translational Medicine, and affiliated with the Rose F. Kennedy Center for Intellectual and Developmental Disabilities Research, and the Department of Neuroscience. The research areas of the lab are computational genomics and bioinformatics, with a strong focus on mining and interpreting large-scale genomic data (ie, bigdata science). The biological themes of our research are centered on the genetic and epigenetic regulations of early development, including neural development and and heart development, and their dysregulation in diseases. We also apply our bioinformatics framework and expertise to study cancer development and treatment. We are always motivated by new challenges in innovative bioinformatics research. *Left is a word cloud summary of the abstracts in our recent publications.


Our Research

Decoding the human genome and other mammalian genomes is our primary research interest, and comparative genomics and integrated analysis of functional genomics data are our major research approaches. In general, we are interested in developing and applying computational methods for exploiting patterns in high-throughput genomic and epigenomic data and for extracting information from experimental and computational bigdata in order to understand the structure, function and evolution of the human genome. We are highly interested in studying the expression, regulation, and evolution of human cardiac and nervous systems genes (coding or noncoding), especially those implicated in neuropsychiatric disorders and congenital heart diseases, using big experimental data from next-generation sequencing (e.g., ChIP-seq, ATAC-seq, BS-seq and RNA-seq from both bulk tissues and single cells). One fundamental question of our research is how a single human or mouse genome is used for establishing and maintaining distinct cell lineages and how this important process is perturbed during disease development. Our research group focuses on multiple areas.




Areas

iPSC system for studying the genetic basis of psychiatric disorders: to use iPSC technology and systems genomics approaches for studying neural development and abnormal gene regulation in neuropsychiatric disorders like schizophrenia and autism spectrum disorders.

In collaboration with experimentalist experts, we grow human neurons in dish by induced pluripotent stem cell (iPSC) technology in order to model human neuronal development and differentiation. We begin by developing iPSC lines from both patients and healthy subjects, differentiate them to neural progenitors and neurons, then use RNA-seq and other deep sequencing technology to identify differentially regulate genes by comparing the transcriptomes between patient-derived neurons and controls. Using this systems biology approach, we have identified many novel long non-coding RNA genes that are involved in embryonic neurogenesis and potentially neuropsychiatric disorders. We also find that many genes show allele-biased gene expression in different brain regions, including some that have been implicated in major psychiatric disorders, which may help explain some aspects of parent-of-origin effects, twin discordance and reduced penetrance. Our studies have also continously to uncover molecular pathways that are affected by critical genes that are major risk factors for schizophrenia and autisms.

Integrated analysis of functional genomics data: to explore computational techniques for combining experimental and computational genomics data in order to achieve a global understanding of the function of the human genome.

We are interested in developing bioinformatics algorithms to mine big genomics data and to conduct cross-genome sequence comparisons (e.g., syntenic assignment) with a focus on deciphering the gene regultory networks underlying normal development and development disorders. As large amount of functional genomic data from diverse sources (microarray, high-throughput sequencing, protein-protein interaction, etc.) are combined and used in our studies, our group develop and apply effective computational methods for data integration and at the same time for addressing common concerns of data quality in genome-scale experiments. A rigorous statistical framework needs to be built in order to extract biologically meaningful signals from noises or stochastic background.

Gene duplication: to investigate how gene duplication has shaped the human genome and how novel genes or regulatory elements emerge in humans and other primates.

Gene expansion (through either duplication or retrotransposition) is a major driving force for the emergence of novel functions during evolution. Inter- and intra-genome sequence comparison can reveal DNA elements that are either uniquely present or specifically selected in certain species. Tracking the evolutionary history of such sequences can lead to the discovery of genes or regulatory elements (including non-coding RNAs) that function specifically in humans. Thus, the long-term goal is to search for functional genomic components that set us apart from other animals. To this end, our research will compare the human genome with other mammalian (e.g., chimp, macaque, dog, and mouse) genomes to exploit the role of gene duplication in generating novel protein-protein interactions and novel biochemical pathways. We are interested in both the birth and the death (pseudogenization or loss) of such lineage-specific functional DNA sequences, especially for those involved in the development of nervous system and brains.

Develop software for improving data quality and addressing new bioinformatics challenges: to investigate how new data analysis algorithms can remove experimental noises and artifacts to increase data and result reproducibbility in big data analysis.

We are particularly interested in the generation of isoforms in different tissues / cells and how these special isoforms carry out their tissue-specific functions.
People

PI: Deyou Zheng, Professor.

Education
  • YALE UNIVERSITY, New Haven, CT. 2003-2007, Postdoctoral Fellow, Department of Molecular Biophysics and Biochemistry. Research Area: Bioinformatics & Human Genomics.
  • RUTGERS UNIVERSITY, Piscataway, NJ. Ph.D. in Biochemistry, 2003. Rsearch Area: Structural Genomics & Bioinformatics.
  • AUBURN UNIVERSITY, Auburn, AL. M.S. in Pathobiology, 1998. Research Area: Molecular Biology.
  • BEIJING AGRICULTURAL UNIVERSITY, Beijing, China. B.S. in Microbiology, 1992

Alex Ferrena, Graduate Student.

Phillip Galbo, Graduate Student.

Yang Liu, Postdoctoral Fellow.

Rohan Misra, Postdoctoral Fellow.

Maider Astorkia, Postdoctoral Fellow.

Positions

We are looking for enthusiastic undergraduate students, graduate students, and postdoctoral associates to join our group. Reasonable computer programming skills is a plus. Candidates are expected to work in a wide range of bioinformatics projects. Please contact the PI, Deyou Zheng, for details.


Selected Publications (*co-corresponding authors)

1. Liu Y, Wang T, Zhou B, Zheng D. (2021) Robust integration of multiple single-cell RNA sequencing datasets using a single reference space. Nat Biotechnol. 39: 877-884
2. Galbo PM, Zang X*, Zheng D*. (2021) Molecular features of cancer-associated fibroblast subtypes and their implication on cancer pathogenesis, prognosis, and immunotherapy resistance. Clin Cancer Res. 27:2636-2647
3. Liu Y#, Singh VK#, Zheng D. (2020). Stereo3D: using stereo images to enrich 3D visualization. Bioinformatics 36:4189-4190
4. Liu Y, Lu P, Wang Y, Morrow BE, Zhou B, Zheng D. (2019) Spatiotemporal Gene Coexpression and Regulation in Mouse Cardiomyocytes of Early Cardiac Morphogenesis. J Am Heart Assoc. 8(15):e012941.
5. Wang P, Zhao D, Lachman HM, Zheng D. (2018). Enriched expression of genes associated with autism spectrum disorders in human inhibitory neurons. Transl Psychiatry 8:13. (PMCID: PMC5802446)
6. Zhao D, Zheng D. (2018). SMARTcleaner: identify and clean off-target signals in SMART ChIP-seq analysis. BMC Bioinformatics. 19:544. (PMCID: PMC6307164)
7. Zhang D, Wu B, Wang P, Wang Y, Lu P, Nechiporuk T, Floss T, Greally JM, Zheng D*, Zhou B*. (2017). Non-CpG methylation by DNMT3B facilitates REST binding and gene silencing in developing mouse hearts. Nucleic Acids Res. 45: 3102-3115. (PMCID: PMC5389556)
8. Wang P#, Lin M#, Pedrosa E, Hrabovsky A, Zhang Z, Guo W, Lachman HM*, Zheng D*. (2015). CRISPR/Cas9-mediated heterozygous knockout of the autism gene CHD8 and characterization of its transcriptional networks in neurodevelopment. Molecular Autism 6:55.
9. Adam RC, Yang H, Rockowitz S, Larsen SB, Nikolova M, Oristian DS, Polak L, Kadaja M, Asare A, Zheng D, Fuchs E. (2015). Pioneer factors govern super-enhancer dynamics in stem cell plasticity and lineage choice. Nature 521:366-370
10. Rockowitz S, Lien WH, Pedrosa E, Wei G, Lin M, Zhao K, Lachman HM, Fuchs E, Zheng D. (2014). Comparison of REST Cistromes across Human Cell Types Reveals Common and Context-Specific Functions. PLoS Comput Biol 10: e1003671
11. Goldberg AD, Banaszynski LA, Noh KM, Lewis PW, Elsaesser SJ, Stadler S, Dewell S, Law M, Guo X, Li X, Wen D, Chapgier A, DeKelver RC, Miller JC, Lee YL, Boydston EA, Holmes MC, Gregory PD, Greally JM, Rafii S, Yang C, Scambler PJ, Garrick D, Gibbons R, Higgs DR, Cristea IM, Urnov FD, Zheng D*, Allis CD* (2010). Distinct Factors Control Histone Variant H3.3 Localization at Specific Genomic Regions. Cell 140:678-691.
12. Guo X, Zhang Z, Gerstein M, Zheng D. (2009) Small RNAs Originated from Pseudogenes: cis- or trans-Acting? PLoS Comput Biol 5:e1000449.
13. Zheng D, Gerstein M. (2007) The Ambiguous Boundary between Genes and Pseudogenes: the dead rise up, or do they? Trends Genet 23: 219-24.

Website designed by Andy Xu.