Data Services
Data Services is a content and technical advisor for projects leveraging HUS data, performing data extractions in the HUS Data Lake for various purposes, such as scientific research and knowledge management.
HUS Data Services
We provide the access to the world’s most comprehensive datasets on specialized health care.
We can deliver Real World Data (RWD) for various use cases, efficiently and securely. The most common purposes are scientific research, information management and authorities’ requests.
The HUS Data Lake contains register data from patient information systems in the various specialties at HUS, such as patient visits and inpatient care periods, diagnoses, procedures performed, laboratory examinations, surgical operations, imaging, pathology samples, intensive care and anesthesia. Medical images, signal data and genome data are also available.
You must have a research permit and/or a data access permit to use the data. Instructions on how to apply for a permit are available here. You may submit a preliminary study to Data Services for your research plan if you wish to explore whether data is available or how large the target cohort is. If you intend to process and combine datasets from various data controllers (for example HUS and other wellbeing services counties), the data permit process is managed by Findata.
In the HUS Data Lake, the data from patient registers and certain administrative registers are organized so that they can be leveraged for example for scientific research, knowledge management or patient care. The HUS Data Lake integrates data from more than 100 different patient information systems and quality registers.
The Data Lake is a patient register that compiles a variety of patient-related health information and is a part of the broader HUS patient information system. It complies with the same data security and GDPR requirements as all other patient and administrative information systems used at HUS, and a Data Protection Impact Assessment (DPIA) has been made. The HUS Data Lake does not combine registers; the datasets retrieved from various registers are kept separate by technical means.
The data in the Data Lake are pseudonymized. All datasets in the HUS Data Lake can potentially be joined via pseudonymized identifiers.
Our work includes finding data for a wide range of use cases, improving the usability and quality of data, investigating and processing new datasets integrated into the Data Lake and generating datasets to add to our service offering. We also validate data, contribute to reporting development and are involved in various projects. Our goal is to provide a seamless service and to deliver high-quality data to customers.
You can submit a data request, preliminary study request, a cost estimate request or and data transfer request (into a secure user environment) through the Data Portal.
Please submit a preliminary study request (e.g. a report on the volume of the target dataset or data availability; this does not require a permit), before applying for a registry research permit, so that we can ensure that the data required for your research is available.
A well thought out specification of your dataset needs at the permit application stage will expedite processing of the application and the creation of the dataset itself.
You can monitor the progress of your data requests and other service requests in the Data Portal.
Datasets subject to the Secondary Use Act will be delivered to a secure user environment (such as HUS Acamedic). We can also deliver datasets as per your data access permit to another audited secure user environment that complies with the Secondary Use Act.
For the time being, HUS has decided to cover the costs of using Data Services for HUS-based research projects. A HUS-based research project is defined as one where the responsible researcher and a significant percentage of the research team members are employed at HUS.
Datasets in the Data Lake
Patient background information
- Demographic information, such as date of birth, date of death, gender, municipality
- Source systems and availability:
- Uranus and Apotti:
- from 2004 - 2026
- a total of approx. 3.6 million patients' background information
- a total of approx. 1.5 million patients' height and weight information
- Uranus and Apotti:
Diagnosis information
- Diagnosis information recorded in the patient information system, such as ICD–10 diagnosis code, the unit recording the diagnosis, the time of update, whether it is a primary or secondary diagnosis, and the reported start date of the diagnosis
- Source systems and availability:
- Uranus:
- partially from the years 2004-2007
- from the years 2008-2020 approx. 2-6 million diagnoses annually
- Apotti:
- partially from the year 2020
- from 2021 onwards approx. 16 million diagnoses annually
- Uranus:
Information regarding visits
- Information concerning the patient's visit, such as the time of the visit, healthcare unit, specialty, type of visit, primary diagnosis, and place for follow-up care
- Source systems and availability:
- Uranus:
- partly from the years 2004–2008
- from the years 2009–2020 about 3 million visits annually
- Apotti:
- gradually starting from the year 2018
- since 2021, over 4 million visits annually
- appointment information from the year 2021 onwards
- Uranus:
Information concerning inpatient care periods
- Information recorded during inpatient care, start and end moments of inpatient care, healthcare unit, admission and subsequent care unit, specialty, main diagnosis and main procedure
- Source systems and availability:
- Uranus:
- partly from the years 2004–2008
- from the years 2009–2019 approx. 200,000 inpatient care periods annually
- Apotti:
- since 2018
since 2021, over 400,000 inpatient care periods annually
- Uranus:
Patient medical record texts
- Patient report texts, statements, specialty (tab), and care report texts recorded in patient information systems
- Source systems and availability:
- Uranus:
- partly from the years 2002–2004
- from the years 2005–2020 approx. 5 million patient record texts annually
- Apotti:
- since 2021, approximately 7 million patient record texts annually
- Uranus:
- SERI-tiedot poistettu aineistosta
In a data request, it would be good to narrow the text, for example, by keyword or specialty area for privacy reasons
Laboratory research data
- Data stored in the laboratory system, such as the laboratory test number, sample collection time, ordering unit, and test result
- Source systems and availability:
- Multilab: since 2000, approximately 10–35 million laboratory tests annually
Procedures details
- Entries related to the procedure, such as the date and procedure code, whether it is a primary or secondary procedure
- Urgency of the surgical procedure, duration of the surgery, tools used, and type of anesthesia
- Source systems and availability:
- Uranus:
- partly from the years 2004–2008
- from the years 2009–2019 approximately 1–2 million procedures annually
- Opera:
- partly from the years 2005–2009
- from the years 2010–2020 approx. 100,000 surgeries annually
- Apotti:
- since 2021, 3–4 million procedures annually
- since 2021 about 160,000 surgery records annually
- Uranus:
Pathology research data
- Pathology examinations and research results as well as statements, using the SNOMED code set
- Source systems and availability:
- Qpati:
- from the years 1987–1993 approximately 16,000–65,000 samples annually
- from the years 1994–2021 approximately 90,000–200,000 samples annually
- from the years 2022-2024 approximately 30,000-100,000 samples annually
- My+:
- since 2020, approximately 150,000 samples annually
- Qpati:
Medication information
- Medication orders and prescriptions recorded for the patient, medication administration records
- Source systems and availability:
- Uranus:
- from the years 2012–2020 approximately 1–4 million medical prescriptions or prescriptions annually
- 1–4 million medication administrations recorded annually
- Kemokur:
- annually around 60,000 patient treatment courses and administration entries from the years 2014–2020
- Apotti:
- from 2019 onwards 1–8 million medical prescriptions or prescriptions annually
- Since 2021, approximately 22 million drug administrations recorded annually
- Marela:
- since 2004 approximately 700,000 medical orders annually
- Uranus:
Intensive care and anesthesia information
- Data recorded in intensive care systems
- Monitor data available in limited quantities
- Source systems and availability:
- Caresuite Picis82: from the years 2012–2020
- Clinisoft Jorvi: from the years 2001–2019
- Clinisoft Haartman Malmi: from the year 2019
- Caresuite Picis80: from the years 2009–2014
- Caresuite Meipicis: from the years 2003–2009
- Caresuite Peipicis: from the years 2006–2009
- Caresuite Toopics: from the years 2005–2009
- Clinisoft LNS kix: from the years 1999–2019
- Clinisoft LNS kvii: from the years 1999–2019
- Clinisoft Meilahti: from the years 1999–2019
- Apotti: from 2020 onwards
- Intensive Care Quality Registry: from 1998 onwards
Emergency medical service alarm tasks
- Information on first response measures and patient transports
- Information recorded in emergency tasks, such as the emergency vehicle's timestamps, the patient's medication, and records
- Source systems and availability:
- Merlotmedi:
- partly from the years 2007–2012
- since 2013, approximately 100,000–200,000 emergency tasks annually
- Merlotmedi:
Imaging study data
- Information regarding imaging studies, such as the date, study number, referral, visit, and report details
- Image data available from the PACS server
- Source systems and availability:
- Mustiradu:
- partially from the years 1996–1998
- from the years 1999–2013 approximately 200,000–1,000,000 imaging studies annually
- HUSRadu:
- partially from 2013
- since 2001 approximately 400,000–800,000 studies annually
- replaced by Apotti 2021–2022
- Apotti:
- partially from 2021
- from 2022 onwards about 900,000 studies annually
- Mustiradu:
Birth information
- Basic information of the newborn, such as time of birth, mode of delivery, measurements, gestational age, and Apgar scores
- Source systems and availability:
- Obstetrix: from the years 2005–2019
- Apotti from 2019 onwards
- From 2006 onwards, approximately 14,000–18,000 births annually
Patient's basic information and measurement results
- Values about the patient systematically recorded in the care chart, such as weight, height, blood pressure, and temperature, as well as their recording unit and time
- Source systems and availability:
- Uranus:
- from the years 2013–2020 approx. 10 million entries annually
- Apotti:
- since 2021, approximately 200–400 million tracking data annually
- Weight, height and BMI compiled from different source systems:
- from the years 2006–2009 approx. 50,000–100,000 measurements annually
- since 2020 about 200,000–800,000 measurements annually
- Blood pressure
- since 2007, approximately 5–22 million measurements annually
- Uranus:
BCB quality register information
- Rekisterit, joista tietoja saatavilla:
- [Aivokasvain]: Brain tumor: approximately 5,200 patients since 2019
- [Astma: yhteensä]: Asthma: approximately 12 000 potilasta since 2018
- [AVH (Aivoverenkiertohäiriö)]: Stroke (Cerebrovascular disorder): approximately 57,000 patients since 2015
- [Kaksisuuntaisen mielialahäiriön] : Bipolar disorder treatment registry: approximately 700 patients from 2018–2020
- [C–hepatiitti]: Hepatitis C: approximately 1,300 patients since 2019
- [Diabetes]: approximately 9,200 patients since 2018
- [Elinsiirrot]: Organ transplants: approximately 27,000 patients since 2015
- [Elvytys]: Resuscitation: a total of approximately 8,000 patients since 2015
- [Elvytys-MET (MET-elvytystiimi)]: Resuscitation-MET (MET resuscitation team): approximately 5,000 patients from 2015–2021
- [Epilepsia]: Epilepsy approximately 7,800 patients since 2018
- [Eturauhanen]: Prostate: approximately 8,400 patients since 2017
- [Eturauhanenonk]: Prostate cancer: approximately 3,600 patients since 2017
- [Glaukooma]: Glaucoma: approximately 700 patients since 2021
- [Gynekologiset syövät]: Gynecological cancers: approximately 11,700 patients since 2018
- [Haava]: Wound: approximately 700 patients since 2021
- [Harvinaissairaudet Rare diseases approximately 6,600 patients since 2017
- [Husuke (pään ja kasvojen epämuodostuma)]: Husuke (head and facial malformation): approximately 11,000 patients since 2016
- [IBD]:Irritable Bowel Disease approximately 10,700 patients since2016
- [Ihosyöpä]: Skin cancer: approximately 27,500 patients since 2017
- [Ihosyöpäonk]: Skin cancer oncology: approximately 800 patients since 2017
- [Implantdb]: Implant database: approximately 40,000 patients since 2007–2021
- [Infcare Infection care: approximately 2,000 patients since 2013
- [Kaihi]: Cataract: approximately 77,000 patients since 2014
- [Katetriläppä (TAVI)]: Catheter valve (TAVI): approximately 750 patients since 2017–2021
- [Keuhkosyöpä]: Lung cancer: approximately 2,800 patients since 2019
- [Kieleke]: Flap: approximately 3,000 patients since 2019
- [Laskimo]: Vein: approximately 11,100 patients since 2017
- [Lasten ja nuorten psykiatria]: Child and adolescent psychiatry: approximately 300 patients from 2018–2020
- [Lasten ja nuorten syöpä]: Child and adolescent cancer approximately 300 patients from 2016–2021
- [Lasten murtumat]: Children's fractures: approximately 34,500 since 2016
- [Lasten selkä]: Children's back approximately 1,600 patients since2015
- [Lasten syöpä]: Children's cancer: approximately 1,800 patients since 2018
- [Lihavuus]: Obesity: approximately 4,900 patients since 2014
- [Lonkka]: Hip: approximately 32,000 patients since 2018
- [Lymfoomaonk]: Lymphoma oncology: approximately 3,600 patients since 2018
- [Munuaissyöpä]: Kidney cancer: approximately 1,800 patients since 2018
- [Munuaissyöpäonk]: Kidney cancer oncology: approximately 1,400 patients since 2017
- [Murtuma]: Fracture: approximately 23,500 patients since 2018
- [Nefrologia]: Nephrology: approximately 6000 patients since 2015
- [Nenä]: Nose: approximately 19,000 patients since 2019
- [Neuromodulaattori]: Neuromodulator: approximately 1,800 patients since 2018
- [Opioidikorvaushoito]: Opioid substitution treatment: approximately 600 patients since 2021
- [Pään ja kaulan alueen syövät]: Head and neck cancers: approximately 14,000 patients since 2018
- [Pään ja kaulan alueen syövät onk]: Head and neck cancer oncology: approximately 3,600 patients since 2018
- [Pad]: PAD: approximately 70,000 patients since 2016
- [Invasiivikardilogia (PCI/ANGIO-rekisteri Invasive cardiology (PCI/ANGIO registry): approximately 21,000 patients from 2017–2021
- [Polvi]: Knee: approximately 34,700 patients since 2018
- [Psykoosi]: Psychosis: approximately 600 patients from 2018–2020
- [Psykoterapia]: Psychotherapy: approximately 20,000 patients since 2018
- [Rakkosyöpäonk]: Bladder cancer oncology: approximately 160 patients from 2017–2019
- [Rektumkarsinooma]: Rectal carcinoma: approximately 10,000 patients since 2014
- [Rektumkarsinoomaonk]: Rectal carcinoma oncology: approximately 3,000 patients since 2017
- [Reuma]: Rheumatism: approximately 55,000 patients since 2015
- [Rintasyöpä]: Breast cancer: approximately 17,500 patients since 2015
- [Rintasyöpäonk]: Breast cancer oncology: approximately 14,300 patients since 2015
- [Sarkooma ja gist]: Sarcoma and GIST: approximately 390 patients since 2020
- [Sarveiskalvo]: Cornea: approximately 1,000 patients since 2020
- [Selkä]: Spine: approximately 37,000 patients since 2016
- [Suonianomalia]: Vascular anomaly: approximately 4,000 patients since 2016
- [Sydänkirurgia]: Cardiac surgery: approximately 24,000 patients since 2017
- [Sydänpysähdys]: Cardiac arrest: approximately 7,300 patients since 2020
- [Syv (selkäydinvamma)]: SCI (spinal cord injury): approximately 2,400 patients since 2015
- [Tahdistin]: Pacemaker: approximately 38,000 patients since 2014
- [Tissuedb]: Tissuedb: approximately 6,300 patients since 2007
- [Tyrä]: Hernia: approximately 42,000 patients since 2016
- [Urogynekologia]: Urogynecology: approximately 33,000 patients since 2016
- [Uroteelisyöpä]: Urothelial cancer: approximately 7,300 patients since 2018
- [Verisuoni]: Circulatory system: approximately 31,000 patients since 2015
- [Verkkokalvo]: Retina: approximately 24,000 patients since 2015
[Ylage]: Ylage: approximately 14,600 patients since 2019nce 2019
Tips from the Data Services for research permit applications
Always request a preliminary study first
Always request a preliminary study before applying for a permit for register-based research. Preliminary studies involve things like investigating the size of the target group and the availability of data. A preliminary study, for which no research permit is required, and no cost is incurred, eliminates unpleasant surprises such as discovering after submitting a research permit application that essential data for the research are not available.
Describe the register data required accurately – use the Dataset Catalog
Please describe the register data you require as accurately as possible in your permit application documentation. You should use the HUS Dataset Catalog, a compilation of some of the available datasets. (Dataset catalogs in other hospital districts may also be useful.) Ensuring that your research plan is consistent with your dataset request helps avoid unnecessary filing for amendments and delays in delivering the datasets.
Indicate in your research plan and consent form that you will be using register data
If you are requesting datasets from the Data Services for which there is a research permit and the study is based on patient consent, then you must indicate in your research plan and patient consent form that register data will be included in your research materials.
Keep your research group details up to date
Details of your research group must be kept up to date, because only persons designated as research group members on Tutkijan työpöytä are allowed to access research materials in the study. Please add any new group members to the research group on Tutkijan työpöytä and submit the amendment application so that the processors become aware of them. You should also contact your research secretary. You must also update your research group details whenever a member leaves the group. Only persons named on the research permit will be issued access rights to HUS Acamedic.
Prices
data-services-price-list.pdf(pdf 145.03 KB) (opens in new window, links to another website)Contact information
Updated: 15.06.2026