GIS StackExchange Analysis, 2014 Edition

It’s time again to revisit my periodic look at GIS StackExchange (GISSE) and what it may or may not tell us about the state of things geospatial. By now, the process is fairly routine. I have single Python script that gets tag data and parses it to CSV. I then hand-edit categories into the data for grouping purpose. While it’s  perfectly valid to quibble with individual category assignments, I’m fairly consistent with it at this point, using previous data sets as a guide. Compared to last year, the all-time look hasn’t changed much. Open-source and “general topics” have switched places, but there were no great shifts that I could see. The roughly 4% increase in open-source topics could be a result of QGIS support moving to GISSE.

gisse_all_time_2014

As I pointed out last year, I am finding the all-time look less useful because I think the overall distribution is heavily affected by discussions in previous years. As a result, we probably won’t see much change as each year increasingly represents only a marginal addition to the overall GISSE body of work. As a result, I decided to compare two yearly snapshots. I processed the previous two 12-month periods and did not expect what I found

gisse_12mo_2013As can be seen in the chart above, the earlier 12-month period showed results that seem to be roughly in line with the all-time distribution. Esri-related discussions were roughly in line with all-time results. Open-source related discussions accounted for a much smaller, but still significant, segment of the result. General topics were by far the biggest kind of discussion during this period.

The results for the most recent 12-month period, shown in the chart below, were much more surprising.

gisse_12mo_2014As can be seen, Esri-related discussions all but disappeared during this period, with open-source and general topics being the primary beneficiaries. In this case, the raw numbers tell the tale. During the first 12-month period, the top 100 tags accounted for 6810 posts. During the second, more recent period, they accounted for 1407 posts, a decrease of 5403 posts from one 12-month period to the next. The next chart shows this side-by-side.

gisse_2013v2014With such a significant drop in traffic, there were clear decreases in all categories, with the most significant decreases being in Esri-related discussions and general topics.

I typically hesitate to interpret this data to infer any conclusions about the GIS market, and I won’t do so this time. I do think this data says something about GISSE itself. There was one significant event this past year that I think could account for the drop-off shown above: the roll-out of Esri’s GeoNet. Esri-related discussions have dominated GISSE since its inception but they virtually disappeared during the 12-month period that included the GeoNet roll-out. I have a hard time thinking of that as a coincidence.

I began investigating this theory by looking at the four months before and after the GeoNet “release” at this year’s Esri UC Plenary. The data I found was inconclusive. I seem to recall getting a GeoNet account some time before that so it is possible that early betas of the platform may have already been having an effect on GISSE. I think I will need to do a month-by-month analysis to see if I can identify when the decline started, but such an analysis will take more time than I have at the moment.

I hope I am wrong, but I sense that the Esri users that had been participating in GISSE have gone back behind the wall. If so, the community just got a little more fragmented. I rendered my opinion from the Plenary when GeoNet was announced, and I stand by it:

The usual caveats apply: Posts can have multiple tags so the raw numbers certainly include multiple counting of posts. The “top 100” tags in any given time period will vary slightly, though usually at the lower end of the scale. Category assignments are based solely on my judgment but I have tried to be consistent each year. Tags can be edited by moderators and post authors, making them somewhat susceptible to potential manipulation.