Someone sent me a link to Matador Network’s 36 MAPS THAT WILL MAKE YOU SEE THE WORLD IN COMPLETELY NEW WAYS. They are all very interesting and I have selected three that are relevant to things I have covered here in the past.
Navies of the Pacific
European Twitter Usage By Language
A picture has been worth a thousand words for quite some time, but social media usage is rewiring our brains, particularly for the millennials. We don’t read so much any more as we scan, evaluate for relevance, and the maybe dig deeper. Maps and infographics not only express complex information, they provide visual anchors for readers who might wish to later return for more information.
Take the time to look through the maps I didn’t feature here, it’s a truly beautiful collection.
The Central African Republic’s troubles were obvious to foreign policy watchers a year ago and now conflict has boiled over into outright ethnic cleansing. As a former French colony the best sources on this area are still in French and as a resource poor, land locked area there simply wasn’t much available in February of last year. Now that things have gotten really bad some new maps have emerged.
Central African Republic Tribal Map
Central African Republic Languages
Central African Republic Conflict Map
And these are the original Central African Republic maps collected on February 3rd, 2013.
I would normally provide some sort of unifying commentary for a collection of maps. All I have to offer today is that someone commented on the lack of quality maps for the Central African Republic, and I decided to go digging.
I first noticed the Global Database of Events, Language, and Tone on New Years day, which led me to write Foreign Policy’s Global Conversation Infograhic. Three weeks later the service was mysteriously suspended and five days later we got an explanation as to what happened.
Since the successful resolution of concerns regarding data and processes used by GDELT they have made dramatic progress. GDELT Analysis Service offers fourteen different visualizations and export of the associated data. I entered the keyword ‘Ukraine’, selected all data for 2014, and about a minute later I received an email with this link to a live heatmap and another link to a CSV file.
Ukraine Mentions During 2014
Next I used the timeline tool. I was hoping for a live page like the location heat map, but it just produces a static image and a CSV file.
Each GDELT event contains a date, geographic coordinates, the players involved, and the type of interaction. The content is regular enough that it could be mapped to an import process for Sentinel Visualizer, an intelligence sector link analysis tool. I had set out to do something similar with the Global Terrorism Database last summer, but there were license issues, and the dataset lacks the live feed feature that GDELT provides.
GDELT is clearly going to grow services based on the live stream of content they have available. These services are likely to play to the strengths they have in defining and operating the stream. There should be plenty of room for follow on qualitative analysis and integration of data sources external to their feed. Data sources such as spot commodity prices in locations trending towards trouble would be particularly helpful in spotting hazards before they get out of hand.
That was a long, uncomfortable silence, after I posted GDELT’s Mysterious Demise, but we now have the particulars on what happened:
The bottom line is that GDELT is one of the very few event datasets in existence today that actually has all of the necessary permissions. The concerns that have recently been discussed were raised by two faculty members at the University of Illinois and were examined by a panel of faculty experts convened by the University of Illinois’ Vice Chancellor for Research. That panel formally cleared GDELT on behalf of that office stating “the Panel finds that it was not able to conclude that GDELT is founded on misappropriated … data or software.” With respect to concerns raised regarding the open source TABARI software that GDELT makes use of to create its CAMEO event records, the same panel explored concerns raised regarding its ownership and similarly found that “TABARI … has well known antecedents at another institution dating back to at least 2000 and therefore is not attributable to the [University of Illinois]“. While this whole situation would have been easily avoided with just a little communication and avoided a lot of unnecessary angst, the silver lining is that it has demonstrated just how widely-used and important GDELT has really become over the past year and we are tremendously excited to work with all of you in 2014 to really explore the future of “big data” study of human society.
I thought there might be a problem with either the underlying data or the software used, turns out that both issues were raised by the University of Illinois professors who parted ways with the project.
This feels a bit like the USL vs. BSDi lawsuit, which freed unix from AT&T’s clutches twenty years ago. A big, important datasource is now out in the open in such a way that it can not be put back. I have some financial records digging to do in the coming week, the Montgomery County Council will remain a priority until the primary is over, but I am itching to wrestle the GDELT feed into some format I can personally use.
When working with grassroots analysts free software and services are key. I have long wished for a project that would fund a copy of Sentinel Visualizer I could keep, but the $5,000 cost for being able to handle temporal and geospatial data is very steep.
I have been inspecting campaign finance data for the Montgomery County Council and the files are clean enough that all of it could be geocoded, the only question was what tool to use. Earlier today noticed Sourcemap in a logistics discussion group on LinkedIn and this just might be the solution we’ve been seeking. Here is an example map from the free service, showing the global sources of Nutella ingredients.
If you sign up for the service it provides you a way to do free, public mappings using a simple spreadsheet format for input.
We can treat financial contributions as raw materials, county council members as ‘legislation factories’, and someone would have to know enough to describe which developer goes with which specific project, then we’d have a finished map of influence to development.
I can do the data handling component, what I need are some local hands and eyes who know more of the companies, personalities, and history behind the urbanization of Montgomery County.
I wrote Foreign Policy’s Global Conversation Infographic on New Year’s Day. The content used to create the visualization was based on the Global Data of Events, Language and Tone, commonly referred to as GDELT. The effort was suspended during the week ending January 17th via this terse announcement.
There was an addendum to this which I didn’t include int he screen capture, but it mentioned Robin Kaler at the University of Illinois. I wrote her seeking additional information on the suspension and I received a response just moments ago.
“serious questions about the origins of the source texts used to code GDELT”
I believe this means that whomever created the CAMEO coding was either not credited appropriately, or there may be an issue with using it in a derivative product. I ran into this late last year – I was going to republish the Global Terrorism Database packaged for use with Sentinel Visualizer, but this was not allowed. I was free to publish a set of scripts to accomplish this task, the issue was that the entities that fund that effort wanted a count of total users, so any derivative work had to be post processing run by the user, rather than repackaging.
I hope what we are seeing here is some sort of pause to clean house and/or make things right with regards to whatever coding material was incorrectly used. The volume and quality of content was extremely promising and I hope the suspension is just some misunderstanding that can be quickly corrected. I kept the archive of the 1979 – 2012 data so I can continue working on something that will handle the live feed when it returns.
As a response to Visualizing Graph Databases With Linkurious I received a tip that I should look at Muckety. This system provides social/network visualization with data from a variety of important sources.
I have looked at The Militarist Galaxy as documented by RightWeb in the past so I choose one of the more noxious individuals, Frank Gaffney, as a starting point. I was pleased and surprised to see the Mother Jones articles on Groundswell were integrated.
Muckety Graph: Frank Gaffney
Gaffney shares the Groundswell link with Allen West, a former Congressman and a war criminal who is equally willing to have journalists abused in this country. I dug just a little further than this and my eyes opened wide at the connections which were revealed.
Muckety: Frank Gaffney & Alen West
A tool like Muckety doesn’t replace a desktop link analysis setup like Maltego, Gephi, or Sentinel Visualizer. What it does do in an easily accessible fashion is permit people who might otherwise never handle such technology to enter the name of an individual or organization that interests them, and immediately see important connections which would otherwise take hours of Googling, reading, and note taking.
Network analysis is to 2014 what social media was to 2009 – something that has specialists using it, but which will rapidly spread due to the powerful sense-making capabilities it offers when trying to understand complex interactions.