Getting Data Retrieval Status
This endpoint provides comprehensive information about the data retrieval progress, including retrieval status for each healthcare location and vectorization status.
How Data Retrieval Status Works
When a patient consents to share their health data, HealthEx initiates a data retrieval process that may fetch records from multiple healthcare locations. This API allows you to monitor the progress of that retrieval and see the vectorization status (which indicates whether the data is ready for AI-powered queries).
Authentication
This endpoint requires a valid Patient Token (either full access or read-only).
See the Authentication guide for more information on obtaining tokens.
Making the API Call
To get the data retrieval status:
GET https://api.healthex.io/v1/patients/data-retrieval/status
Accept: application/json
Authorization: Bearer <patient JWT token>
This API requires you to pass a patient JWT token to authenticate. See the Authentication guide for more info.
Response Format
If the request is successful, we will respond with a 200 status code. The response body will look something like this:
{
"patientId": "17d4513f-73e8-4b2a-9c1d-5e6f7a8b9c0d",
"retrievalId": "batch-abc123-def456",
"dataRetrievalStatus": "IN_PROGRESS",
"vectorizationStatus": "IN_PROGRESS",
"createdAt": "2024-01-15T10:30:00.000Z",
"updatedAt": "2024-01-15T11:45:00.000Z",
"locations": [
{
"locationId": "loc-hospital-123",
"locationName": "General Hospital",
"locationLogoUrl": "https://example.com/logo.png",
"locationSource": "TEFCA_DOCUMENT",
"dataRetrievalStatus": "COMPLETE",
"updatedAt": "2024-01-15T11:30:00.000Z",
"documentCount": 5
},
{
"locationId": "loc-clinic-456",
"locationName": "Downtown Clinic",
"locationLogoUrl": null,
"locationSource": "USER_SELECTED",
"dataRetrievalStatus": "IN_PROGRESS",
"updatedAt": "2024-01-15T11:45:00.000Z",
"documentCount": 0
}
]
}
Response Fields
patientId: The patient's unique identifierretrievalId: Unique identifier for this retrieval batchdataRetrievalStatus: Overall status of the data retrieval (see values below)vectorizationStatus: Status of data vectorization for AI queries (see values below)createdAt: ISO 8601 timestamp of when the retrieval was initiatedupdatedAt: ISO 8601 timestamp of the most recent update across all locations and vectorization (null if never updated)locations: Array of location-specific retrieval statuses
Location Fields
Each location in the locations array contains:
locationId: Unique identifier for the healthcare locationlocationName: Human-readable name of the location (may be null)locationLogoUrl: URL to the location's logo image (may be null)locationSource: How this location was discovered (see values below)dataRetrievalStatus: Retrieval status for this specific locationupdatedAt: ISO 8601 timestamp of the last update for this location (may be null)documentCount: Number of documents retrieved from this location
Overall Data Retrieval Status Values
The overall dataRetrievalStatus field indicates the status of the current retrieval:
IN_PROGRESS: The retrieval is still active and not yet finalizedCOMPLETE: The retrieval has been finalized, meaning all processing is complete
Location Data Retrieval Status Values
Each location's dataRetrievalStatus field indicates the retrieval progress for that specific healthcare location:
NOT_STARTED: Retrieval has not begun for this locationIN_PROGRESS: Data is currently being retrieved from this locationCOMPLETE: Data retrieval completed successfully for this locationPARTIAL_COMPLETE: Some documents were retrieved, but the process encountered issuesSKIPPED: This location was skipped (e.g., patient opted out)ERROR: An error occurred during retrieval from this location
Location Source Values
The locationSource field indicates how the healthcare location was discovered:
TEFCA_DOCUMENT: Discovered through a TEFCA document queryTEFCA_ENDPOINT: Discovered through a TEFCA endpoint queryUSER_SELECTED: Manually selected by the patient
Vectorization Status Values
The vectorizationStatus field indicates whether the retrieved data has been processed for AI-powered queries:
IN_PROGRESS: Data is being prepared for AI queries (includes queued, waiting, and actively processing states)COMPLETE: Vectorization completed successfully - data is ready for AI-powered queriesFAILED: Vectorization encountered an error
Error Responses
We may return errors in the following scenarios:
No Retrieval Found
HTTP 404 - No data retrieval request was found for the patient. This may mean the patient has not initiated a data retrieval process yet, or the retrieval was aborted before document selection was completed.
Unauthorized
HTTP 401 - The token is missing, invalid, or expired.
Forbidden
HTTP 403 - The token does not have permission to access this endpoint. This typically happens when the token is not a patient token (PATIENT or PATIENT_READ_ONLY type).
Usage Notes
- This endpoint only returns the most recent data retrieval batch for the patient where document selection was completed
- The
updatedAttimestamp reflects the latest update across all locations and vectorization processing - When
vectorizationStatusisCOMPLETE, the patient's data is ready for AI-powered queries via the MCP server
See Also
- Authentication: Learn how to set up API credentials and manage tokens
- Accessing Patient Data: Understand how to access patient data once retrieval is complete
- HealthEx MCP Server: Use the MCP server for AI-powered queries once vectorization is complete