Data Intelligence,
Engineered.

AI-powered platforms that turn decades of industrial data into decisive operational intelligence.

Aberdeen Est. 2021 Asset-Intensive Industries
Explore Work Start a Conversation
Scroll
Approach

Industrial data science, end to end. From legacy archives to live operations, we turn raw data into AI systems, analytics and intelligence the people who run your assets can actually use.

What We Do

Six disciplines.
One delivery.

End-to-end intelligence systems, from raw legacy documents to production-grade applications that engineers actually use.

01 / AI & Reasoning

LLMs & RAG

Retrieval-Augmented Generation pipelines grounded in your own inspection records, drawings and operational documents. Production LLM integration, vector databases, document intelligence.

LLMRAGVector DBNLP
02 / Engineering

Full-Stack Build

Architecting and shipping the application layer — combining LLM reasoning, vector search and industrial data into platforms designed for the workflows of integrity engineers and asset managers.

PythonC#AWSDocker
03 / Data

ETL & Digitisation

Turning legacy inspection reports, RBI studies and unstructured operational documentation into clean, queryable, analysis-ready datasets — at scale, with AWS Lambda, Textract and bespoke pipelines.

LambdaTextractDynamoDBETL
04 / Vision

Computer Vision

Automated interpretation of technical drawings, isometrics and P&IDs — CNN and Siamese network architectures that classify, extract and digitise the assets buried in your archive.

CNNTensorFlowKerasOCR
05 / Optimisation

Constraint Solving

Decision systems that schedule under real-world constraints — crew skills, bed limits, materials, dependencies, area access. Solver-backed plans that maximise value, re-optimise on disruption, and explain themselves to the people holding the budget.

CP-SATOR-ToolsSchedulingPinned Re-Opt
06 / Forecasting

Predictive Models

Time-series projection and self-correcting models calibrated against measured ground truth. Earned-value forecasting, demand projection, equipment-life prediction — built with two-model designs that sidestep the well-known fragility of single-equation forecasts.

Time-SeriesCalibrationEarned ValueETC / EAC
For Clients

Built for the field.

A selection of platforms designed and delivered to clients across asset-intensive industries — oil & gas, renewables, industrial operations. Each shipped as a working application that engineers use in their daily work.

QUERY · FIG.01
30D15D5D1D BASELINE 8–24× FASTER
Platform 01Cycle Time
QUERY

Equipment-centric intelligence.

An AI-powered platform that reorients industrial data around the equipment itself — letting engineers interrogate decades of inspection records, RBI studies and operational documentation in natural language, with traceable, source-linked answers.

8–24× Time savings on asset-data gathering for integrity reviews & life-extension studies
CONTEXT · FIG.02
INSPECTIONS DRAWINGS RBI STUDIES QUERY
Platform 02Embedding Space
Context

The institutional memory layer.

A vector database and Retrieval-Augmented Generation platform purpose-built for industrial document intelligence. Search, synthesise and reason over the technical archive — drawings, reports, specifications — with citations engineers can trust.

Vector Embeddings + grounded LLM reasoning over technical document corpora
EXTRACT · FIG.03
INSPECTION REPORTS 7,420 TECHNICAL DRAWINGS 5,180 RBI STUDIES 3,890 PIPING ISOMETRICS 2,055 TOTAL DIGITISED 18,545
Platform 03Throughput
Extract

Legacy archives, digitised at scale.

Modular ETL software that ingests unstructured legacy documentation — scanned reports, technical drawings, multi-decade inspection histories — and outputs structured, analysis-ready data. Deployed across multiple client engagements.

Multi-Op Delivered across major North Sea operators & industrial clients
SMART DRAWING · FIG.04
DETECTIONS3 / 3 CONFIDENCE0.97 CLASSPIPE-04 FLANGE F-204 VALVE V-118 ELBOW E-097 LINE NO.P-1024-A REV.03 PARSED2025
Platform 04Vision
Smart Drawing

Reading the technical archive.

Automated interpretation of piping isometrics, P&IDs and technical drawings using convolutional and Siamese neural networks — converting hand-drawn and scanned engineering documents into machine-readable, queryable assets.

CNN Drawing classification, isometric extraction & anomaly interrogation
CAMPAIGN OPTIMISER · FIG.05
D1D3D5 D7D9D11 NOW WI-001WI-002WI-003 WI-004WI-005WI-006 CPI1.04 SPI0.98 POB28/32 VALUE PLANNED£4.2M
Platform 05Optimisation
Campaign Optimiser

A constraint solver for offshore campaigns.

Full-stack platform replacing spreadsheet-based campaign planning with a CP-SAT constraint optimiser — maximising included work-item value subject to crew skills, POB bed limits, materials and dependencies. Break-in impact analysis pins the existing schedule so users see only the genuine knock-on, not solver churn.

CP-SAT Solver-backed scheduling · multi-tenant · plan versioning & rollback
Personal Practice

Built for myself.

The personal lab — where the same engineering discipline gets applied to a problem I actually own. A test bed for the local-first patterns that show up in client work.

VITALS · FIG.01
92KG88KG 84KG80KG WK 0WK 4 WK 8WK 12 CALIBRATED MODEL NAÏVE 7700 KCAL/KG Δ 4KG · 12 WEEKS
Personal BuildSelf-Hosted PWA
Vitals

A health platform that runs on hardware you own.

Household-scale, multi-user health and fitness tracking — built on local infrastructure. Smart-scale ingestion, on-device OCR + LLM nutrition parsing (no third-party calls), Apple Watch GPX rendering on Leaflet, and a self-correcting weight prediction model. PWA-installable, deployable from a PC to a Raspberry Pi.

React · TS · ViteFastAPI · asyncPostgreSQLTesseract OCROllama (Qwen)Docker · ARM64
Anatomy of an Application

What's inside a build.

The components that typically ship in a Data Rich application — an intake card, an acceleration curve, a risk surface, a throughput meter, a grounded answer, an anomaly view, and an attribution layer. Figures below are illustrative.

Discipline   Industrial Data Science
Form Factor   Engineered Application
Delivery   Web · PWA · API
Fig. 01 — What You GetDelivery

A Data Rich application typically ships in three layers — a data substrate, the intelligence services that reason over it, and the application surface engineers actually use.

SubstratePostgres · Vector · Object Store
IntelligenceLLM · RAG · CV · Solver
SurfaceWeb · PWA · Auth · Dashboards
Fig. 02 — Time to InsightAcceleration
BASELINE · 30 DAYS / REVIEW 30D22D14D 6D1D WK 0WK 1WK 2 WK 3WK 4WK 5 WK 6 1 DAY / REVIEW
Fig. 03 — Risk SurfaceOperational View
LOW HIGH
Fig. 04 — ThroughputIngestion
18,545
Total · 4 Formats
Fig. 05 — Grounded Q&ARAG · Cited

"Show me the corrosion history of caisson C-4 over the last decade — and flag anything outside the inspection envelope."

12 documents4 inspections2 anomalies0.97 confidence

Caisson C-4 exhibits localised pitting in the splash zone, first noted INSP-2019-04. Subsequent ROV surveys (2021, 2023) confirm no propagation beyond the original wear pad. One anomaly — wall-thickness reading below tolerance — flagged in RBI-2024-08; investigate before next campaign.

INSP-2019-04ROV-2021-07RBI-2024-08DWG-C4-REV5+8 more
Fig. 06 — Anomaly Surfacen = 1,284
NOMINAL3 OUTLIERS
Fig. 07 — Source AttributionPer Response
0.97
Source-Cited Responses
Oil & Gas Subsea Infrastructure Renewables Pipeline Integrity Industrial Operations Asset Management Oil & Gas Subsea Infrastructure Renewables Pipeline Integrity Industrial Operations Asset Management
By the Numbers

A measured record.

Peak time-saving multiplier on integrity reviews
0+
Years building AI for asset-intensive industries
1st
Class Honours BSc Data Science · Robert Gordon University
Academic class prizes during graduate apprenticeship
Richard Thomson
RT
Richard Thomson Lead Data Scientist · Founder
Founder

A decade in industrial data.

Lead Data Scientist with a First Class Honours degree in Data Science and over five years building AI-powered data solutions for asset-intensive industries — from document controller at Lockheed Martin and information management at Spirit Energy, to designing and delivering the QUERY platform.

Specialist in LLM integration, vector databases, full-stack application development and the digitisation of decades-old industrial archives. Track record of managing technical delivery and client engagement across oil & gas, renewables and industrial operations.

— Data, distilled.

Get in Touch

Let's build something that lasts.

If your organisation is sitting on decades of unstructured technical data — and the people who understand it are retiring — we should talk.

richard@data-rich.co.uk