Data Scientists of 2016

A searchable and interactive network graph of Data Scientists archived in February 2016

The Twitter Network of 30,000 "Data Scientists" and their connections

The Twitter Network of 30,000 “Data Scientists” and their connections

The visualisation was created using Gephi after applying 3 steps on NodeXL. Step 1 – A Twitter search for the term ‘datascientist’ on 02/02/2016 that returned… Step 2 – 1,132 user accounts Step 3 – Input the 1,132 user accounts in to a Twitter User search that returned 30,000 user accounts. Export into Gephi with Modularity Class selected to differentiate groups and OpenOrd to set the graph.

You can search by name or select various groups that have been classed together according to modularity.

30000 Data Scientists 2016 pdf version

Data Scientist Twitter Network Visualisation 2016

@analyticsbridge Twitter Network Visualizationn 2016



The quest to find Data Scientists specifically related to Health has taken one small step forward.

For further information contact

The Forbes Billionaire List and their Connections – 2015

The Forbes 2015’s Billionaires (2015). Forbes ranks more than 1,800 billionaires and these are their company, and affilations with Government Bodies etc – This list was then adapted by LittleSis . I have then used it to construct a network which I’ve then visualised using Gephi.

Clicking on the image opens an interactive and searchable version.

imageof forbes2015

The Forbes Rich List and their Connections

It follows the same principle as those laid out here

Forbes magazine has been publishing the list of The World’s Most Powerful People since 2009. The number of people in the list is proportional to the global population with the ratio being one slot for every 100 million people on Earth. When the list started in 2009, there were 67 people on the list and the latest list from year 2014 had 72 people. According for Forbes, the list is calculated based on the person’s influence over lots of other people (e.g. Pope Francis, Wal-Mart CEO, Doug McMillon), financial resources controlled by the people (e.g. GDP, market capitalization, profits, assets, revenues and net worth), power in multiple spheres (e.g. Bill Gates), active use of power by the people (e.g. Vladimir Putin). While this list gives a snapshot of global ranking, it does not reveal information about past and present network connections and inter-linkages between people and organisations, the spread of power across the network, information about key entities who act as network intermediaries for power and which cluster of entities are most prominent in this global power structure. This is where we complement the Forbes data with LittleSis, which is a database of who-knows-who at the heights of business and government. LittleSis has information of the global rich and powerful such as past and present organizational affiliations (employment, directorships, memberships, alumni networks), donations (political contributions, grants), social connections (family ties, mentorships, friendships), professional connections (partnerships, supervisory relationships), services/contracts (legal representation, government contracts, lobbying services) etc.

For further information regarding this interactive and searchable visualisation please contact or @soci

Interactive Graph of Wikipedia: Influential Thinkers

Interactive Version

Interactive Version

Graphs of Wikipedia: Influential Thinkers –

While I work out how to extract data from Wikipedia thought I should post this here.

The bigger the node, the larger the betweenness centrality score i.e. the bigger influence that person had on the rest of the network.  These are the most influential figures in the network. However I do agree with…

This however brings us to one of the largest problems in doing work like this; the graph is intrinsically “wrong”. Brendan Griffen.

The inspiration for this graph was from Griffen – who had promised to make the interactive version of his graph available but is yet to do so.

To show just how “wrong” these graphs are – here is exactly the same data but this time a different algorithm (hub) has been used to size the nodes…

Interactive map


The map is only useful when you zoom in close to particular areas or type in a name in the search function.

Below is a close up of a red section that on closer inspection is a group of comedians…


Big Data Blogs and Websites July 2015

The source of seed blogs and websites is this list of Top Analytic Blogs and Web Sites  is analysis is based on data submitted on sign-up by 16,000 Analyticbridge members, between February 2008 and December 2011.

Interactive visualisation of a hyperlink analysis of blogs and websites associated with Big Data and analytics.

Interactive visualisation of a hyperlink analysis of blogs and websites associated with Big Data and analytics.