Search Close Search
Search Close Search
Page Menu

The UMass Chan Data Lake

UMass Chan Data Lake implementation combines data from disparate sources including data from the UMass Memorial Hospital Electronic Health Record System (EPIC), Public Health Data, Patient Registries and Administrative Data. The EPIC system represents the most comprehensive data source available for the research community and comprises a variety of data domains.

The data from clinical systems is routinely added into the Data Lake and is stored in its native format. The data then goes through a data engineering process (Extract, Transform, Load or ETL) to create a useful format for analysis. The ETL process extracts data from the source systems, applies data quality and consistency standards, integrates data from separate sources, and delivers data in the appropriate format for Visualization and Analytics.


Data Lake