Skip to main content

Data Dictionary

Savvy investors may consider developer activity a predictive indicator of the success or failure of a technology company. Sherlock data includes a growing list of 100+ tickers.

Available Data Tables

 Five comprehensive datasets covering developer activity and project health
Chart
PROJECT CDI
Community Development Index score measuring developer community engagement
File
WEEKLY FILE TYPE CHANGES
Detailed file modification tracking by type and category showing development focus areas
Code
CODE CHANGE AGG
Aggregated developer activity including commits, issues, pull requests with daily/weekly/monthly views
Dev
DEVELOPER AGG
Developer experience cohorts tracking engagement patterns and community growth over time
Folder
PROJECT REPOSITORIES
Repository metadata, social signals, blockchain categorization and project classification

Sherlock Community Development Index (CDI) Weights

Learn more about CDI at: https:/chaoss.community/?p=4455

19.987%

Contributor Count

16.363%

Commit Frequency

13.853%

Is Maintained

12.612%

Commit to Pull Request Ratio

11.319%

Pull Request to Issue Ratio

10.113%

Pull Request Review Ratio

10.113%

Pull Request Merge by Others Ratio

5.640%

Lines of Code Frequency

CDI & Core Metrics

CDI
The predictive measure that investors may use to identify technology companies with strong developer communities
CODE CONTRIBUTOR COUNT
Active pull request creators, code reviewers, and commit authors in the past 90 days
COMMIT FREQUENCY
Average number of commits per week over the past 90 days
IS MAINTAINED
Percentage of code repositories with at least one commit in the last 90 days
CODE REVIEW RATIO
Percentage of code commits with at least one reviewer (not pull request creator) in last 90 days
LINES OF CODE FREQUENCY
Average lines touched (added plus removed) per week in the past 90 days

Developer Analytics

TOTAL DEVS CT
Count of unique developers that have made at least 1 code commit
DISTINCT DEVS COUNT 1 MONTH
Developers that actively committed code in exactly 1 distinct month
DISTINCT DEVS COUNT 2 12 MONTH
Developers that actively committed code in 2 to 12 distinct months
 
CDI
Community Development Index score measuring developer community dedication (0-5 range)
CODE CONTRIBUTOR COUNT
Active PR creators, code reviewers, and commit authors in the past 90 days
COMMIT FREQUENCY
Average number of commits per week over the past 90 days
COMMIT FREQUENCY
Average number of commits per week over the past 90 days
COMMIT FREQUENCY
Average number of commits per week over the past 90 days
COMMIT FREQUENCY
Average number of commits per week over the past 90 days
DISTINCT DEVS COUNT 13 PLUS MONTH
Developers that actively committed code in more than 12 distinct months
DEVELOPER COUNT
Count of unique developers that have made at least 1 code commit

Pull Request & Issue Metrics

PULL REQUEST CREATED
Count of requests to merge code changes into the main branch
PULL REQUEST MERGED
Count of merged pull requests into the main branch
ISSUES OPENED
Count of suggested improvements, tasks or questions related to the project
ISSUES CLOSED
Count of issues closed by developers within a project
COMMIT PULL REQUEST LINKED RATIO
Percentage of new code commits that link to pull requests in the last 90 days
PULL REQUEST ISSUE LINKED RATIO
Percentage of new pull requests that link to issues in the last 90 days
CODE MERGE RATIO
Percentage where pull request mergers and pull request authors are different people in last 90 days

Repository & Social Metrics

FORK COUNT
Count of times a user has duplicated the selected repository
STARGAZERS
Count of users that have starred the selected repository
WATCHERS

The count of users that are currently watching the selected repo, which enables notifications when the repo is updated

COMMIT COUNT
Count of changes to a code base and its associated files within the selected repo
ORG REPO URL
The URL of the project repository
DESCRIPTION
The description of the project written by the organization administrator

Project Classification

PROJECT
The colloquial name of the open source project
PROJECT CATEGORY
The company sector or category
LOGIN
The name of the project organization
IS CORE
Binary flag identifying if the repo is part of the primary core organization

File & Change Tracking

FILE TYPE CATEGORY
The file type category, summarizing the general purpose of the file
FILE TYPE
Format and nature of files changed in repository, based on file extension
FILE CHANGES

Count of files added, deleted, or modified within selected file type and time frame

REPO
Centralized file storage location for code files, revisions, and documentation

Temporal & Configuration

DT
Update date for developer cohorts
PROJECT START DATE
Date when first commit was made under this project
PROJECT DEVELOPMENT START DT

The date that the first repository of the project was created

FIRST COMMIT DT
The date of the first code commit within the selected repository
START DATE
Date metric calculation began

Sherlock Data Coverage

 

Technology

Media

Travel

Fintech

Communications

AI

Blockchain

Explore the data on Snowflake

Invest in leading technology teams

Sherlock Data helps quantitative and fundamental portfolio managers incorporate open-source software development trends into their investment strategy.