An interactive, static explorer is available in docs/ and can be published via GitHub Pages. It visualizes UMAP/t‑SNE embeddings from Cell Ranger analysis and colors cells by clusters.
The explorer automatically references these paths for each dataset:
outs/analysis/umap/2_components/projection.csvouts/analysis/tsne/2_components/projection.csvouts/analysis/clustering/graphclust/clusters.csv
Datasets currently included:
oyster_r1and2_CP3_roslin-mito-CRv3oyster_r1and2_CP2_roslin-mito-CRv3oyster_r1and2_CP1_roslin-mito-CRv3oyster_r1and2_Bla_roslin-mito-CRv3oyster_E4_redo2_roslin-mitooyster_E3_redo2_roslin-mitooyster_E2_redo2_roslin-mitooyster_E1_redo2_roslin-mito
Expression layers are not bundled for page weight reasons. To add expression:
- Export per‑cell expression for a set of genes into CSV files with headers
Barcode,<GENE>. - Place them under
docs/data/<dataset>/<gene>.csvand extendloadCoordinates/loadClusterswith aloadExpressionfunction. - Wire the
Showbutton indocs/index.htmlto overlay expression values as a color scale.
Repository associated with manuscript:
Primordial Germ Cell Specification and Early Developmental Cell States in Pacific Oyster
Mackenzie R. Gavery, Lauren Vandepas, Lauren M. Saunders, Brent Vadopalas, J. Adam Luckenbach, Cole Trapnell, Steven Roberts. Primordial germ cell specification and early developmental cell states in Pacific oyster. BMC Genomics 26, 951 (2025). https://doi.org/10.1186/s12864-025-12122-7
This repository contains single-cell RNA sequencing (scRNA-seq) analysis data and scripts for studying primordial germ cell specification and early developmental cell states in Pacific oyster (Crassostrea gigas).
Contains C. gigas gene annotation files used for Monocle3 analysis:
- Gene annotation tables with LOC identifiers
- Homology tables linking to other species (e.g., Strongylocentrotus purpuratus)
- R script for joining annotation files (
annotations_joining_SRhomologtable.R) - NCBI gene table downloads
Key files:
LOCannot_homologies.txt- Gene homology annotationsGCF_902806645.1_genetable_downloaded_011924.tsv- NCBI gene tableCg_hits-red_SpurSupp.tab- Cross-species gene comparisons
Output files from 10x Genomics CellRanger analysis (BAM files excluded to save space):
- Gastrula stage samples:
oyster_E1_redo2_roslin-mitothroughoyster_E4_redo2_roslin-mito - Cleavage and blastula samples:
oyster_r1and2_Bla_roslin-mito-CRv3(Blastula)oyster_r1and2_CP1_roslin-mito-CRv3throughoyster_r1and2_CP3_roslin-mito-CRv3(Cleavage period)
These directories contain filtered/raw feature-barcode matrices and analysis outputs used as input for Monocle3 analysis.
Monocle3 analysis scripts and RDS (R data serialization) files:
Main analysis scripts:
monocle_gastrula_manuscript.R- Gastrula stage analysismonocle_CP_manuscript.R- Cleavage period analysismonocle_bla_manuscript.R- Blastula stage analysismonocle_CPbla_manuscript.R- Combined cleavage and blastula analysismonocle_figure_April2025.R- Figure generation scriptmonocle_subcluster_analysis.R- Subcluster analysisMarkerGenes_markerscore.R- Marker gene scoring analysisscRNA_dotplot_clustered.R- Dot plot visualizationbarplot_semiquant.R- Semi-quantitative bar plots
RDS data objects:
oyster_CP_only.RDS- Cleavage period cell datasetoyster_blastula_only.RDS- Blastula stage cell datasetoyster_cleavage_blastula.RDS- Combined dataset
Subdirectories:
RDSgeneration_scripts_andRDSfiles/- Scripts for generating RDS objectsMarkerScoreAnalysis/- Marker gene analysis results (top markers by score for different stages)
CellRanger job scripts and genome reference files:
Job scripts:
count_CellRanger_E1_redo2.jobthroughcount_CellRanger_E4_redo2.job- Gastrula samplescount_CellRanger_Bla_Roslin-mito.job- Blastula samplecount_CellRanger_CP1_Roslin-mito.jobthroughcount_CellRanger_CP3_Roslin-mito.job- Cleavage period samplesCellRanger_mkref_Roslin.job- Reference genome creation
Subdirectory:
Cgigas_Roslin_mkref_genome-CRv3/- CellRanger genome reference (STAR index and GTF files)
Note: Large genome/reference files (.fa, .gtf, STAR index files) are excluded via .gitignore
Sequence Read Archive (SRA) submission metadata files:
- Sample metadata spreadsheets
- FASTQ file metadata and quality checks
- Multiple versions tracking submission revisions
Note: These files document data submission but are not required for reproducing the analysis.
Supplementary materials for the manuscript:
supplementary_figures_singlecell_oyster.docx- Supplementary figuressupplementary_files_singlecell_oyster.xlsx- Supplementary data tables
pgc-edc-oyster.Rproj- RStudio project file.gitignore- Excludes large genome files, CellRanger outputs, and R workspace files
- CellRanger processing: Raw sequencing data processed using scripts in
sc_gigas_jobs_genomicresources/ - Gene annotation: Annotation files prepared in
annotation_files_script_forMonocle3/ - Monocle3 analysis: Cell clustering, trajectory analysis, and marker identification using scripts in
Monocle_Rscripts_RDS/ - Visualization: Figures generated using various R scripts for manuscript preparation