Research Room Access with Specified Datasets
Gain secure access to a dedicated research environment with tailored datasets, enabling in-depth analysis under controlled conditions. This service ensures compliance with data protection standards while giving researchers the flexibility to explore complex clinical questions with precision.

Execution of client-developed algorithms on our database

Leverage our comprehensive, depersonalized database by executing your own algorithms securely within our infrastructure to generate customized insights. This approach combines the client’s unique expertise with our high-quality data resources, enabling advanced analytics without compromising confidentiality.
Provision of data research services
Benefit from our expert-led data research services, providing rigorous analysis and actionable results to support evidence-based decision-making. Our team works closely with partners to translate complex datasets into clear findings, tailored to the specific needs of each project.

Summary of data assets
Core Facility Services @ Semmelweis University
Clinical Data Center
There is a growing demand for the secondary use of data accumulated over decades in healthcare IT systems for research and analysis purposes. This is complicated by the heterogeneity of the data, the specific characteristics of data collection for non-research purposes, language barriers, and the lack of standardization. These problems were addressed during the creation of Semmelweis University's Clinical Data Center by following OMOP standards and using large language models (LLMs) to process free-text documentation. Over the past few years, these tools have been used in practice to create a system that is suitable for both domestic and international collaborations.
There is a growing demand for the secondary use of data accumulated over decades in healthcare IT systems for research and analysis purposes. This is complicated by the heterogeneity of the data, the specific characteristics of data collection for non-research purposes, language barriers, and the lack of standardization.
These challenges were addressed during the creation of Semmelweis University's Clinical Data Center by following OMOP standards and using large language models (LLMs) to process free-text documentation. Over the past few years, these tools have been used in practice to create a system that is suitable for both domestic and international collaborations.
Gain secure access to a dedicated research environment with tailored datasets, enabling in-depth analysis under controlled conditions. This service ensures compliance with data protection standards while giving researchers the flexibility to explore complex clinical questions with precision.

Summary of data assets
Research Room Access with Specified Datasets
Gain secure access to a dedicated research environment with tailored datasets, enabling in-depth analysis under controlled conditions. This service ensures compliance with data protection standards while giving researchers the flexibility to explore complex clinical questions with precision.
Patient population
Demography
• 56.3% female (1,401,420) • Paediatric (below 18 years): 321,384 • Adults (18–64 yrs): 1,273,364 • Elderly (65+ yrs): 744,300 • Oldest old (85+ yrs): 174,182
Timeframe & Updates
Coverage: 2011 – October 2025 Updates: every 3 months
Data Types
• Demographics • Case-level data • Diagnoses & interventions • Imaging & laboratory results • Prescriptions
Standards
OMOP Common Data Model (CDM)
Access & Usability
• Data delivery in 1–2 weeks (depending on complexity) • Fully depersonalized / anonymized dataset
Gain secure access to a dedicated research environment with tailored datasets, enabling in-depth analysis under controlled conditions. This service ensures compliance with data protection standards while giving researchers the flexibility to explore complex clinical questions with precision.
