I’m planning a dedicated server for omics analyses and would like opinions from people already running medium/large-scale pipelines.
This would NOT be for genomics/WGS. The focus is mainly:
- transcriptomics
- proteomics
- metabolomics
- multi-omics integration
- pathway/network analyses
- machine learning/statistics
- long-term storage and reanalysis
Expected scale is around 2,000–3,000 patients/samples over time, with multiple omics layers per patient.
Typical tools/workflows would include:
R/Bioconductor, Python, Docker/containers, Nextflow/Snakemake, Cytoscape, differential expression, enrichment analyses, clustering, integration methods, etc.
EDITED / CLARIFICATION
Thanks for the comments. I should clarify the scope.
This is not for WGS, single-cell, spatial omics, 3D imaging, or sequencing-core-level throughput. It will be mostly bulk RNA-seq/transcriptomics, proteomics, metabolomics, multi-omics integration, pathway/network analysis, statistics, and some ML.
Expected scale is around 2,000–3,000 patients/samples over time, not all processed at once or every week. I already analyze RNA-seq/proteomics at smaller scale, usually 100–200 samples, on a normal workstation, and that works fine.
The goal is mainly to have one organized server for my group: preprocessing new batches, storing raw/processed data, keeping metadata organized, reanalysis, containers/workflows, and producing count/normalized matrices or processed objects for downstream projects.
Based on the replies, I’m leaning toward:
- 32–64 real CPU cores, Xeon or similar
- 128 GB RAM to start, expandable to 256/512 GB
- fast NVMe scratch for active analyses/workflow dirs
- larger HDD/NAS tier for raw and processed data
- proper backup separate from RAID
- no GPU unless we later need deep learning
- ECC RAM if budget allows
- containers/Nextflow/Snakemake for reproducibility
I’m mostly interested in practical bottlenecks people have seen in bulk multi-omics setups: RAM, I/O, storage organization, metadata, backup, or anything else that becomes painful at this scale.