From raw data to reliablemarket insights
Developed over nearly a decade, our proprietary webAI technology turns open-source data into verified, decision-grade intelligence.
Science-based 360°Open-Source Intelligence
Market intelligence is only as strong as the data behind it. Traditional providers rely on standardized databases that are rarely updated and built on rigid industry codes and predefined filters. ISTARI takes a fundamentally different approach: Instead of pulling data from existing databases, we generate it specifically for each client, on demand from all open web sources.
Built on a decade of research into web-based market analysis
ISTARI's webAI technology is built on over a decade of foundational research in web-based market intelligence, led by PhD-level founders who helped shape the field. Unlike conventional providers that rely on standard AI models, ISTARI develops its own scientifically validated methodology. This approach is supported by more than 25 peer-reviewed publications and continuously validated through collaborations with leading international research partners.
The webAI method: Precision data through 100% OSINT
WebAI is based exclusively on Open Source Intelligence (OSINT). We use systematic analysis of publicly accessible web data — without gray markets, opaque sources, or compliance risks. This allows us to generate a customized, highly current data foundation for each of your specific questions.

The limits of general-purpose chatbots in strategicmarket intelligence
With the rise of powerful large language models, many organizations consider using chatbots for strategic market analysis. The limitation lies in their underlying architecture. While general-purpose models are designed to generate broad and versatile outputs, they face clear constraints when applied to specialized, high-stakes market intelligence, constraints that webAI is built to overcome.
01
Grounding
The problem of outdated training data
Standard models answer queries primarily from their training data, so essentially from "memory." As a result, the information generated this way can be outdated or plausibly fabricated ("hallucination").
WebAI always generates responses through live analysis. We validate facts in real time against official registries and current web data, and back every data point with transparent source references.
The Global Organization Index (GOI)
Our proprietary organization-level dataset and the foundation of everything we do. Comprising approximately 20 million active, verified organizations across 232 countries and territories, it is one of the most rigorously curated global indexes of its kind, built on validated and deduplicated data.

400M
Registry Collection
Systematic querying of national registries worldwide, enriched with open data sources.
40M
Domain Attribution
Attribution to identifiable web domains. Inactive or non-operational organizations are filtered out.
20M
Activity Verification
Verification of actively operated domains. Only active organizations remain in the final dataset.
Top countries by coverage
232
Countries and territories
Top industries (NACE)
22
Industry sectors (NACE)
Distribution of organizations by size
42.8%
Micro
0–9 employees
43.7%
Small
10–49 employees
8.8%
Medium
50–249 employees
4.8%
Large
250+ employees