Sildes
On Jan 29th 2026 I gave a talk at the Enterprise Tech Monthly (ETM) meetup, hosted by LSEG in London on Data for AI.
Here are the slides: Deconstructing an LLM: What data is inside?
Event information: https://www.eventbrite.co.uk/e/deconstructing-an-llm-what-data-is-inside-tickets-1980199014558 (sign up for future ETM events!)
I did a 10min version of the talk last summer: https://youtu.be/hhbH375tV84
Get Involved!
Here are some links to the topics I talked about:
Open Data Institute (ODI) Solid (Social Linked Data)
Email the Solid team at the Open Data Institute on solid@theodi.org
Or read more at https://solidproject.org/ or https://theodi.org/what-we-do/solid/
EDMAssociation: AI, Data & Analytics Controls (ADAC)
The ADAC Working Group meets Mondays 3pm UK time. Contact Oli on LinkedIn to be added to the calendar invite, or just register to join direct here:EDM Association ADAC Weekly Zoom meeting link:
https://us06web.zoom.us/j/85769777904?pwd=Jy0NrVbU1eO1MfCR87qKLQpzwzrZQt.1
Please click the link to register in advance for your individual zoom meeting link.
EDM Association: Data Products (DPROD)
The DPROD group meets weekly on Thursdays 4.30pm UK time. Contact Tony Seale to be invited to the zoom
FinOS CALM & CCC
Financial Open Source Foundation (FinOS) - Common Architectural Language Model (CLAM)
The Architecture as Code working group develop CALM and meet monthly on Tuesdays 4-5pm. See the FinOS community calendar for the registration link.
CALM repo: https://github.com/finos-labs/architecture-as-code
Common Cloud Controls (CCC)
A FinOS project for establishing a common set of automated cloud controls for regulated industries.
Introductory white paper: https://github.com/finos/common-cloud-controls/blob/main/docs/Citi-Contributed-White-Paper/Financial-Services-Common-Cloud-Controls-Standard-v1.0-(for%20publication).pdf
Total Neural Enterprises (TNE.ai)
TNE.ai build enterprise AI that uses CDMC, ADAC and Ai2 OLMo. Get in touch with Oli Bage on LinkedIn to learn more.
Allen Institute for AI (Ai2) builds the Open Language Model (OLMo) family of frontier models, using open training data. https://allenai.org/olmo
I traced the lineage of the training data using Solidatus - an industry leading lineage visualisation tool. Learn more about Solidatus here: https://www.solidatus.com/
Play with the OLMo2 lineage model, hosted on the EDMConnect intance of Solidatus here (free registration for EDMConnect, but many corporates are already members:
https://edmc.solidatus.com/viewer/685bfbd05c54812d123d1dc8