Elucidata and Sapien Biosciences Form Strategic Partnership to Transform India's Largest Biobank into AI-Ready Data Platform
- Elucidata and Sapien Biosciences announced a strategic partnership to convert Sapien's 300,000+ patient sample biobank into AI-ready, multimodal data products for drug discovery and diagnostic development.
- The collaboration will focus initially on building AI models that infer genomic and transcriptomic insights from digital pathology images using NGS-annotated slides from Sapien's cancer biobank.
- Sapien operates one of the top 10 biobanks globally with over 85,000 cancer patients and represents the largest integrated resource for Asian patient data across multiple disease areas.
- The partnership aims to address underrepresentation in global omics datasets while accelerating precision medicine through synthetic clinicogenomic data generation.
Elucidata, a leader in data-centric AI for drug discovery and translational research, and Sapien Biosciences, India's first and largest commercial biobank, have announced a strategic partnership to transform Sapien's extensive biobank assets into AI-ready, multimodal data products for drug and diagnostic development. The collaboration aims to maximize the value of biobank data through AI-assisted harmonization, leading to precise patient outcomes at scale and lowering the cost of drug discovery.
Sapien Biosciences, founded in partnership with Apollo Hospitals, operates at unprecedented scale with access to over 300,000 patient samples across oncology, cardiology, autoimmune, inflammation, neurology, and other diseases. This includes more than 85,000 cancer patients, increasingly paired with digitized histopathology images, genomic profiles, and structured longitudinal clinical data, making it one of the top 10 biobanks globally and the largest integrated resource for Asian patient data.
Through this collaboration, Sapien will harness Elucidata's platform to integrate, standardize, and enrich its vast repository of biospecimens and curated clinical data, making them accessible for predictive modeling, synthetic data generation, and advanced genomics research.
"At Sapien, we believe that deeply characterized patient data especially from underrepresented populations like those in India can catalyze more inclusive and effective diagnostics and therapies worldwide," said Dr. Jugnu Jain, CEO and Co-founder of Sapien Biosciences. "This partnership with Elucidata enables us to convert our rich biological and clinical datasets into interoperable, AI-ready formats, unlocking their potential for translational research, patient stratification, and precision drug development."
The first phase of the collaboration will focus on building AI models that infer genomic and transcriptomic insights from digital pathology images. By using NGS-annotated slides from Sapien's cancer biobank, the project will enable the generation of synthetic multi-modal datasets essential for next-generation diagnostic tools and tissue-sparing research strategies. This approach is particularly valuable for rare cancers and limited biopsy specimens.
"Sapien's scale, sample quality, and data depth make it a critical partner in our mission to democratize access to high quality, AI-ready biomedical data," said Dr. Abhishek Jha, CEO and Co-founder of Elucidata. "By applying our Polly platform to Sapien's datasets, we can bridge the gap between fragmented sample collections and next-gen AI models that accelerate target discovery and biomarker validation."
Beyond oncology, the partnership will explore multi-disease use cases across Sapien's biobank, including cardiovascular, autoimmune, and neurological disorders, building synthetic clinicogenomic datasets that support pharma R&D and companion diagnostic development. The Polly platform integrates EHRs, genomics, imaging, and clinical trial data for seamless downstream analysis and model development.
"This partnership between Elucidata and Sapien Biosciences brings together Sapien's extensive, high-quality biobank resources and Elucidata's expertise in AI-powered data integration," said Dr. Navjot Singh, advisor of Elucidata.io and healthcare leader with 22 years of expertise in strategy, technology, and innovation, most recently as a Senior Partner at McKinsey & Company. "By making rich biospecimens and clinical datasets AI-ready, this collaboration has the potential to accelerate global innovation and help reduce the complexity of human biology for research and patient care."
Elucidata recently ranked 7th in the Broad Institute's Autoimmune Disease Machine Learning Challenge and won the National Cancer Institute's Cancer Research Data Commons AI Data Readiness Challenge, highlighting its leadership in data-centric AI for life sciences. The company supports over 70 pharma and diagnostics clients and has contributed to 16 drug programs progressing toward FDA approvals.
Sapien Biosciences has published over 10 peer-reviewed studies using Indian patient data and collaborates with leading diagnostic and pharma companies to reduce the representation gap in global omics datasets. With access to more than 300,000 patients' samples including 2 million pathology samples, Sapien offers FFPE tissues, blood, and liquid biopsy samples along with annotated histopathology slides and longitudinal clinical data.

Stay Updated with Our Daily Newsletter
Get the latest pharmaceutical insights, research highlights, and industry updates delivered to your inbox every day.
Related Topics
Reference News
[1]
Elucidata and Sapien Biosciences Announce Strategic Partnership to Transform Biobank ...
silicon.co.uk · Jun 5, 2025
[2]
Elucidata and Sapien Biosciences Announce Strategic Partnership ...
finance.yahoo.com · Jun 4, 2025
[3]
Elucidata and Sapien Biosciences to transform biobank assets into ...
biospectrumindia.com · Jun 5, 2025
[4]
Elucidata Partners with Sapien Biosciences to Transform Biobank ...
hyper.ai · Jun 5, 2025