Bulk Data Access
Bulk data access supports scalable retrieval of patient records across populations for use cases such as analytics, regulatory reporting, and payer operations. Health Gorilla implements the FHIR $export operation to deliver large datasets in structured formats, enabling clients to extract full or filtered views of clinical data at scale.
Use Cases
Bulk export is designed for large-scale access where retrieving individual records is inefficient or impractical.
Common scenarios include:
- Risk adjustment and cohort-based analysis
- Claims validation and encounter reconciliation
- Healthcare Effectiveness Data and Information Set (HEDIS) and quality reporting programs
- Data lake ingestion and archival
Export Types
Health Gorilla supports multiple variants of the $export operation, depending on your patient scope:
GET /fhir/R4/$exportretrieves all patients accessible to your clientGET /fhir/R4/Group/{id}/$exportretrieves data for a specific groupGET /fhir/R4/Patient/{id}/$exportretrieves a full export for a single patient
Each call initiates an asynchronous job and returns a Content-Location header that you can poll to monitor status and retrieve results.
File Format and Delivery
Export results are delivered as newline-delimited JSON (NDJSON), with separate files generated for each FHIR resource type.
For example:
Patient-1.ndjsonEncounter-1.ndjsonObservation-1.ndjson
Each line in the file represents a complete FHIR resource. When the export job completes, your system receives a list of pre-signed download URLs for all available files. Files remain accessible for a limited time, typically between 24 and 72 hours.
Filtering Options
You can limit the size and scope of export jobs using query parameters. This helps you retrieve only relevant data and reduce processing overhead:
_since={timestamp}filters resources modified after the specified date_type=ResourceType1,ResourceType2limits output to the listed resource types_typeFilter=ResourceType?searchParam=valueadds specific filters to one or more resource types
Filtered exports complete faster, generate fewer files, and improve downstream processing performance.
Performance Considerations
Bulk jobs can be resource intensive. To ensure optimal performance:
- Use Group-based exports for defined cohorts rather than exporting your entire population
- Filter by resource type or modification date to limit payload size
- Schedule exports during non-peak usage hours to reduce queue time
- Monitor job status using the Content-Location endpoint to avoid missed download windows
Each tenant is subject to export throttling and queue management to protect system performance.
Summary
Bulk data access allows you to retrieve structured patient data efficiently using the FHIR $export API. This supports high-volume workflows such as quality reporting, claims review, and analytics. Scope your export jobs using filters and patient groups to control output size and improve performance. All exports are delivered in a secure, asynchronous manner with time-limited file availability.