Drupal Public Data, Statistics & Silver Linings? An Exploration

Drupal Installation Analysis Part 1
Drupal Public Data, Statistics & Silver Linings? An Exploration

This is part 1 of a 5-part series examining Drupal public data in search of actionable insights.

Drupal Installations Analysis

The "Usage statistics for Drupal core" Page

This article analyzes publicly available data presented by the Drupal Project regarding the number of reportedly active Drupal core installations in the period December 22 - 28, 2024.  It is available at: https://www.drupal.org/project/usage/drupal 

Drupal frames the data set in the following way:

"This page provides information about the usage of the Drupal core project, including summaries across all versions and details for each release. For each week beginning on the given date the figures show the number of sites that reported they are using a given version of the project."

Drupal also qualifies the quality data set in this manner:

"These statistics are incomplete; only Drupal websites using the Update Status module are included in the data. This module has been included with the download of Drupal since version 6.x so the data does not include older sites."

A Justification for Using Statistics

Statistics is "the discipline that concerns the collection, organization, analysis, interpretation and presentation of data" (Wikipedia, 2025).  It seems therefore prudent (and perhaps even enjoyable) to attempt applying the approaches and tools of such a discipline to the data set offered by Drupal.  

Digging deeper, Statistics to be the ideal discipline to turn to in circumstances where data has been deemed uncertain, considering that "Statistics is both the science of uncertainty and the technology of extracting information from data" (Hand, 2010), which are the two overriding considerations of undertaking this work. 

Two more things make Statistics stand out as the ideal tool to help power this investigation:  Data Quality, an important sub-discipline of Statistics, was developed to help improve the quality of data sets so as to increase "the capability of data to be used effectively, economically and rapidly to inform and evaluate decisions" (Karr et al, 2002).  Also, Exploratory Data Analysis, yet another sub-discipline of Statistics and one that is frequently used in conjunction with Data Quality, was developed, according to IBM (2025), to:

"...help look at data before making any assumptions. It can help identify obvious errors, as well as better understand patterns within the data, detect outliers or anomalous events, find interesting relations among the variables."

With a defence as to the relevance and efficacy of applying Statistics to the Drupal data set finished, we can now move on to the fun stuff:  Examining and interpreting the Usage statistics for the Drupal core data set for the period December 22 - 28, 2024.

The Analysis

Here is the raw data as provided by Drupal:

Table 1:  Data as Provided by Drupal
Week ending:28-12-2024Week ending:28-12-2024
VersionInstallationsVersionInstallations
5.x1729.1.x3,249
6.x11,4759.2.x5,143
7.x265,8739.3.x11,337
8.x19.4.x13,558
8.0.x1,1909.5.x56,738
8.1.x52510.0.x7,154
8.2.x93110.1.x14,645
8.3.x1,36510.2.x42,349
8.4.x94410.3.x136,153
8.5.x3,60710.4.x26,755
8.6.x5,61510.5.x257
8.7.x6,04311.x5,226
8.8.x7,00111.0.x6,466
8.9.x31,76911.1.x8,284
9.0.x1,393Total675,218

(the only transformation applied to this data was to reformat it so it appears vertically rather than its original presentation, which was originally presented horizontally) 

(For the purpose of layout, we have moved half the data from columns A and B and continued it on C and D - Editor)

Statistic #1:  Share Percentage

A simple statistical manipulation would be to add percentages to the data under the heading "Share"

Table 2:  Data as Provided by Drupal with Share Percentage
Week ending:

28-12-2024

VersionInstallationsShare (%)VersionInstallationsShare (%)
5.x1720.03%9.1.x3,2490.48%
6.x11,4751.70%9.2.x5,1430.76%
7.x265,87339.38%9.3.x11,3371.68%
8.x10.00%9.4.x13,5582.01%
8.0.x1,1900.18%9.5.x56,7388.40%
8.1.x5250.08%10.0.x7,1541.06%
8.2.x9310.14%10.1.x14,6452.17%
8.3.x1,3650.20%10.2.x42,3496.27%
8.4.x9440.14%10.3.x136,15320.16%
8.5.x3,6070.53%10.4.x26,7553.96%
8.6.x5,6150.83%10.5.x2570.04%
8.7.x6,0430.89%11.x5,2260.77%
8.8.x7,0011.04%11.0.x6,4660.96%
8.9.x31,7694.70%11.1.x8,2841.23%
9.0.x1,3930.21%Total675,218100.00%

Statistic #2:  Edition Share Percentage

Another simple statistical manipulation would be to gather the "Editions" of Drupal together (i.e. Drupal 5.x, Drupal 6.x, Drupal 7.x, etc...), aggregating them into an "Edition share", while revising the original "Share" to read "Version Share":

Table 2:  Data as Provided by Drupal with Version & Edition Share Percentage
Week ending:

28-12-2024

VersionInstallationsVersion Share (%)Edition Share (%)VersionInstallationsVersion Share (%)Edition Share (%)
5.x1720.03%0.03%9.1.x3,2490.48% 
6.x11,4751.70%1.70%9.2.x5,1430.76% 
7.x265,87339.38%39.38%9.3.x11,3371.68% 
8.x10.00%8.74%9.4.x13,5582.01% 
8.0.x1,1900.18% 9.5.x56,7388.40% 
8.1.x5250.08% 10.0.x7,1541.06%33.67%
8.2.x9310.14% 10.1.x14,6452.17% 
8.3.x1,3650.20% 10.2.x42,3496.27% 
8.4.x9440.14% 10.3.x136,15320.16% 
8.5.x3,6070.53% 10.4.x26,7553.96% 
8.6.x5,6150.83% 10.5.x2570.04% 
8.7.x6,0430.89% 11.x5,2260.77%2.96%
8.8.x7,0011.04% 11.0.x6,4660.96% 
8.9.x31,7694.70% 11.1.x8,2841.23% 
9.0.x1,3930.21%13.54%Total675,218100.00%100.00%

Statistic #3:  Aggregated Edition Share Percentage

Another simple statistical manipulation would be to create a summary table of the different Editions featuring number of installs (#) and Edition Share (%) along with some sanity checks of derived table totals against the original raw data totals, which is a very basic application of Data Quality (DQ).

Table 3:  Aggregated Edition Share Percentage
EditionInstallations (#)Share (%)
5.x1720.03%
6.x11,4751.70%
7.x265,87339.38%
8.x58,9918.74%
9.x91,41813.54%
10.x227,31333.67%
11.x19,9762.96%
Total675,218100.00%
Raw675,218100.00%
CheckYY

EDA #1:  Edition Share Percentage as Pie Chart

Drawing on the tools and techniques of Exploratory Data Analysis (EDA), we can now use the summary table to help generate a pie chart that expresses Edition Share in a visual way:

Figure 1:  Edition Share Percentage Pie Chart
 
Edition Share Percentage Pie Chart

EDA #2:  EDA's Ability to Power Statistical Inferences

One of the "superpowers" of EDA is harnessing the visual center of the human brain, a finely tuned facility that has benefited from over 250,000 years of evolution.  

From a systems perspective, this is akin to loading a well-tested, comprehensive "library" in order to deal with an existing situation in a completely novel way.  Sometimes, the use of a new approach or perspective can render a heretofore intractable or complicated situation simple and solvable.

What can be immediately inferred from the EDA-powered data projection through a pie chart metaphor is the relative size of the "pie piece" of each Drupal edition:

  • The Edition share of Drupal 5 is almost unobservable.
  • The Edition Share of Drupal 10 and Drupal 7 are large, and roughly equal in size.
  • The Edition Share of Drupal 8 and Drupal 9 are medium, and roughly equal in size.
  • The Edition share of Drupal 11 and Drupal 6 are small, and roughly equal in size.

These observations inevitably lead back to a deeper examination of the source data because of the shape of the data.  Presumably, those Drupal Editions were released in series, with Drupal 5 being the earliest recorded Edition and Drupal 11 the most recent.

So what's going on?

The next piece in this series introduces a set of useful statistical models that could be immediately applied to the Drupal data set in the quest to derive actionable insights from their raw numbers.


In 2002, Professor Graham Leach began lecturing at Hong Kong Polytechnic University, the largest school in Hong Kong, by teaching graduate level courses in the Department of Computer Science of the School of Engineering. In 2010, he moved over to the School of Design and remained there until his retirement in 2023.   Graham currently occupies a Professorship of Entrepreneurship in the School of Business of Gratia Christian College, Hong Kong's newest Tertiary educational institution.

Image Attribution Disclaimer: At The Drop Times (TDT), we are committed to properly crediting photographers whose images appear in our content. Many of the images we use come from event organizers, interviewees, or publicly shared galleries under CC BY-SA licenses. However, some images may come from personal collections where metadata is lost, making proper attribution challenging.

Our purpose in using these images is to highlight Drupal, its events, and its contributors—not for commercial gain. If you recognize an image on our platform that is uncredited or incorrectly attributed, we encourage you to reach out to us at #thedroptimes channel on Drupal Slack.

We value the work of visual storytellers and appreciate your help in ensuring fair attribution. Thank you for supporting open-source collaboration!

Note: The vision of this web portal is to help promote news and stories around the Drupal community and promote and celebrate the people and organizations in the community. We strive to create and distribute our content based on these content policy. If you see any omission/variation on this please reach out to us at #thedroptimes channel on Drupal Slack and we will try to address the issue as best we can.

Advertisement Here
Advertisement Here

Upcoming Events

Latest Opportunities

Advertisement Here