Building a Unified AAC Research Data Infrastructure for the NHS
Specialised AAC services across the NHS hold valuable data on the people they support, the interventions provided, and the technology prescribed. However, this data is fragmented across local SQL databases, Word/PDF case notes, and file storage systems. This fragmentation makes it difficult to answer critical national questions such as:
- Who are the clients currently using AAC?
- What is the distribution of diagnoses (e.g., cerebral palsy GMFCS III–V, MND bulbar vs spinal onset)?
- Which equipment types, access methods, and symbol sets are most commonly used?
- What are the average times from referral → assessment → provision?
- Where are the bottlenecks, gaps, or inequities in service delivery?
This project proposes to build a national AAC research data layer that unifies data from multiple services into a consistent, de-identified, and ethically governed system. By combining structured (SQL) and unstructured (Word/PDF) sources into a standard model, and embedding a Trusted Research Environment (TRE) with the HDR UK Data Access Agreement (DAA), the project will provide an independent, transparent basis for answering service planning and equity questions across the UK.
- Unify Data Sources: Connect to local SQL databases and file stores across services, extracting structured and unstructured data.
- Standardise & De-identify: Transform all data into a Common AAC Research Model (CARM), with pseudonymisation at source.
- Strengthen Governance: Implement the HDR UK DAA template within the TRE, ensuring clear, standardised data access processes.
- Enrich Clinical Understanding: Apply text mining and rule-based coding to extract structured diagnosis categories (e.g., GMFCS levels, MND onset type).
- Support National Queries: Provide an independent query interface, enabling researchers and commissioners to explore patterns in AAC usage, provision, and timelines.
- Highlight Gaps & Inequities: Identify where services struggle with provision timelines, access to technology, or equitable distribution of resources.
- HDR UK DAA adoption project (£30k grant, Feb–Aug 2026):
- Gap analysis of current contracts vs DAA template.
- Population of annexes, SOP updates, training of IG/contracts staff.
- Pilot AAC dataset access via TRE under DAA.
- Collect impact metrics: time from approval → DAA signature, contract iteration counts, qualitative ease-of-use.
- Output: TRE operational with DAA in place; one pilot AAC dataset accessible under new governance.
- Ingestion of structured SQL + unstructured file data into Spark/Delta.
- Build bronze → silver → gold tables:
- Persons (de-identified), Encounters, Provisions, Notes, Diagnoses.
- Apply rule-based dictionaries (GMFCS, MND subtypes).
- Establish de-identified gold dataset in TRE.
- Run initial analyses:
- Client mix (diagnoses, age, region).
- Timelines (referral → assessment → provision).
- Equipment/access method distributions.
- Equity across IMD quintiles and regions.
- Independent evaluation of data quality, accuracy of diagnosis inference, and TRE usability.
- Apply for larger NIHR i4i or SBRI Healthcare calls (~£300–500k) to extend to all specialised AAC services.
- Integrate into NHS England commissioning metrics.
- Publish annual AAC service equity and access reports.
- Pseudonymisation at source using HMAC on NHS numbers; no direct identifiers exported.
- Trusted Research Environment (TRE) holds only de-identified gold data.
- Data Access Agreement (DAA) governs researcher access, embedded via HDR UK grant.
- Transparency standards: publish data dictionary, governance processes, and small-n suppression rules.
- Ethics: independent governance board with patient/AT user representation.
- Immediate (2025): HDR UK TRE/SDE DAA Adoption Grant (£30k, Feb 2025–Aug 2026).
- Mid-term (2026–27): NIHR i4i PDA or FAST awards for R&D of middleware + multi-site deployment (£200–500k).
- Challenge-led (2026–28): SBRI Healthcare themed calls (productivity, equity, digital data) to fund national scaling.
- Complementary: HDR UK small calls (TRE infrastructure), Health Foundation/charitable funds for user involvement, and local ICB contributions for service deployment.
- Provide the first unified national picture of AAC service users in the NHS.
- Enable evidence-based planning of provision models, workforce, and funding.
- Support equity monitoring, ensuring clients across diagnoses, geographies, and socioeconomic groups have equal access.
- Allow commissioners to identify where provision delays or equipment mismatches occur and act proactively.
- Demonstrate a governance-first approach by embedding the HDR UK DAA, creating a replicable model for other datasets.