Methodology
A transparent, reproducible methodology
Every number in Connect51 traces back to open data and a documented method. No black boxes.
Built on OpenAlex
Connect51 is built on OpenAlex, the open catalogue of the global research record — over 46 million works, with their authors, institutions, funders, and citations. Because the source data is open, every figure we show can be traced back to its origin.
Field-Weighted Citation Impact (FWCI)
Raw citation counts favour large, established fields. FWCI normalises citations against the world average for the same field, year, and document type, so a value of 1.0 means world-average impact and 2.0 means twice the world average. This lets you compare a mathematician and a microbiologist on a level footing.
The h5-index (h-5)
The h5-index is the h-index measured over the last five complete years: an entity has an h5-index of h when h of its publications from that window have each been cited at least h times. It rewards recent, sustained output — consistent, well-cited work — rather than a handful of older, highly cited papers.
Standing
Standing is our composite measure of an institution’s pull in a field — combining output, citation impact, and the strength of its collaboration network. It answers a question rankings can’t: where does this institution genuinely lead?
A documented subject taxonomy
We map every work to a custom Division and Subject taxonomy so benchmarking is consistent and comparable across institutions. The full taxonomy and every calculation are documented — reproducible by anyone with access to OpenAlex.
Sustainable Development Goals (SDGs)
We tag every work against the United Nations’ 17 Sustainable Development Goals using OpenAlex’s open SDG classifier. It is a multilabel model: each goal is scored independently from 0 to 1, so a single paper can map to several goals — or to none.
We only count a goal when its score is 0.4 or higher. Below that threshold the false-positive rate climbs steeply — the model starts tagging anything that mentions “water” with clean-water goals, or “city” with sustainable-cities — which is noise rather than signal. Because SDG tags feed institutional rankings and sustainability reporting, we favour precision over recall: a missed tag is less damaging than a spurious one. The 0.4 cutoff is deliberately conservative and empirically supported — independent studies that reviewed score distributions with expert input have landed on the same value.
Why transparency matters
Research leaders make high-stakes decisions on this evidence. That is why our methodology is public: so you can trust the numbers, and defend them.
See the methodology in action
Book a demo and we'll walk through exactly how your numbers are calculated.