I was drowning in context-switching across 5+ business systems, missing follow-ups, and spending hours pulling data that should have taken seconds. So I built an autonomous AI that monitors my work environment, surfaces what matters, and handles the operational overhead that used to eat my day. It runs 24/7 on production infrastructure, not as a demo.
Vizro dashboard showing live KPIs, P&L, pipeline, and bank balances Dummy Data
I run finance and operations at a tech company. My day is split across Teams, SharePoint, Xero, HubSpot, and email. Someone asks a question in one system, the answer lives in another, and the context for why it matters is in a third. I was spending hours just pulling data, missing follow-ups because nobody tracked who said what, and scrambling for context before meetings.
"What does the Customer X contract say about payment terms?" That's a SharePoint search. "Who hasn't paid us?" That's Xero. "What deals is the sales team working on?" HubSpot. "Tell me about Customer X." That needs all three. The information always existed. It was just scattered across 6,000+ files, multiple systems, and years of accumulated documents.
Someone asks a question in Teams. The pipeline triages it, figures out whether it needs documents, live accounting data, CRM data, or all three at once, and returns an answer as an Adaptive Card with approve/edit/reject buttons. Live data questions hit Xero or HubSpot directly, zero LLM cost. Document questions run a hybrid search (BM25 + FAISS + reranker). Cross-source queries like "tell me about Customer X" pull from everything simultaneously. A PII blocker catches sensitive data before anything goes out.
BM25 for keywords, FAISS with E5-Large-Instruct for semantics, Reciprocal Rank Fusion merges rankings, then a ROSES AI reranker picks the top five. HyDE generates domain-specific terms so vague queries like "anything about the Customer X deal" still find the right documents. 100% accuracy on production tests. The search pipeline is frozen. No changes without explicit approval.
"Who hasn't paid us?" pulls live Xero receivables. "What deals is David working on?" queries HubSpot pipeline data. "Find the employment contract template" searches SharePoint. Ask "tell me about Customer X" and it pulls from all three simultaneously. 17 direct API actions, and when a query routes to live data the LLM never touches it. Just formatted numbers, straight back.
Built on McKinsey's Vizro framework. Covers the full picture: P&L, cash position, sales pipeline, receivables, CRM activity, engagement trends. 40+ chart types. There's an AI chat overlay too, so you can ask Claude to recut any chart without leaving the page.
10+ scheduled jobs that run without any manual trigger. Morning and evening briefings land in Teams with calendar, email highlights, financial alerts, and follow-up reminders. Deal stage monitoring checks HubSpot every 5 minutes during business hours. An invoice scanner picks up PDFs from my inbox, uses Vision AI to extract line items, matches against Xero records, and flags duplicates or price drift. A commitment tracker detects promises in Teams and email, then alerts when they're overdue. Pre-meeting briefs generate 15 minutes before each call with attendee CRM history and open invoices.
Every evening, it writes a first-person diary entry from the day's activity: Teams conversations, emails, calendar, deal changes, financial updates. A separate behavioural feedback system analyses my communication patterns and flags blind spots like unreplied messages, tone issues, or neglected relationships. Rolling memory tracks per-person interaction metrics and recurring feedback themes. Weekly digest every Friday with trends and self-review evidence.
It remembers what you were talking about. Per-user context with a 2-hour window and 10-turn memory. Ask about Customer X, then follow up with "what about their payment terms?" and it knows what you mean. Every answer comes back with options and nothing goes out without you approving it. It also learns which style of answer you tend to pick from your approval history.
Ask a question in Teams, pick an answer, click approve. Nothing gets sent without you seeing it first.
Same multi-agent architecture, applied to markets. Four agents with deterministic kill-switches and risk veto.
View case study
Different stack, same problem-first approach. TypeScript/React Chrome extension for WhatsApp productivity.
View case study
Where I first learned that the best problems sit between departments. Family textile business, digital from scratch.
View case studyNobody could find anything in 7,000 files. Every question meant someone stopping their day to dig through folders.