logo
episode-header-image
Nov 2024
42m 24s

Github Collaboration Network

Kyle Polich
About this episode

In this episode we discuss the GitHub Collaboration Network with Behnaz Moradi-Jamei, assistant professor at James Madison University.  As a network scientist, Behnaz created and analyzed a network of about 700,000 contributors to Github's repository.  The network of collaborators on GitHub was created by identifying developers (nodes) and linking them with edges based on shared contributions to the same repositories. This means that if two developers contributed to the same project, an edge (connection) was formed between them, representing a collaborative relationship network consisting of 32 million such connections.
By using algorithms for Community Detection, Behnaz's analysis reveals insights into how developer communities form, function, and evolve, that can be used as guidance for OSS community managers.

Up next
Dec 26
Video Recommendations in Industry
In this episode, Kyle Polich sits down with Cory Zechmann, a content curator working in streaming television with 16 years of experience running the music blog "Silence Nogood." They explore the intersection of human curation and machine learning in content discovery, discussing ... Show More
38m 16s
Dec 18
Eye Tracking in Recommender Systems
In this episode, Santiago de Leon takes us deep into the world of eye tracking and its revolutionary applications in recommender systems. As a researcher at the Kempelin Institute and Brno University, Santiago explains the mechanics of eye tracking technology—how it captures gaze ... Show More
52m 8s
Dec 8
Cracking the Cold Start Problem
In this episode of Data Skeptic, we dive deep into the technical foundations of building modern recommender systems. Unlike traditional machine learning classification problems where you can simply apply XGBoost to tabular data, recommender systems require sophisticated hybrid ap ... Show More
39m 57s
Recommended Episodes
Jun 2022
Network Analyzer with Zach Seils and Manasa Chalasani
tail spinning
38m 50s
Sep 2024
Stack Overflow Signs Deal with OpenAI to Sell User Data
<p>In this episode, we explore the recent partnership between Stack Overflow and OpenAI, detailing how Stack Overflow's vast repository of developer insights and coding solutions will be utilized to enhance OpenAI's models. We'll dive into the implications of this collaboration f ... Show More
6m 12s
Oct 15
Inside the Linux Foundation's Open-Source Movement
Daniela Barbosa, General Manager of Decentralized Technologies at the Linux Foundation, and Executive Director at LF Decentralized Trust, discusses the most promising open-source projects they've supported so far, and how more builders can get involved. She also emphasizes the im ... Show More
24m 34s
Sep 2017
TBP156 - Combined Forces for Better Results
The Sweetbridge Foundation, a non-profit aiming to leverage blockchain technology to power the next generation of global supply chain networks, announced that blockchain expert Vinay Gupta joined its Advisory Group. Drawing upon his decades of experience in the cryptocurrency, te ... Show More
1h 31m
Nov 2024
scikit-learn & data science you own
We are at GenAI saturation, so let’s talk about scikit-learn, a long time favorite for data scientists building classifiers, time series analyzers, dimensionality reducers, and more! Scikit-learn is deployed across industry and driving a significant portion of the “AI” that is ac ... Show More
52m 2s
Nov 2024
Building an AI creator community w/ Civitai founders Justin Maier and Maxfield Hulker
Ever since generative AI tools like Midjourney became available to the public in 2022, curious users and AI fanatics alike have been experimenting with the technology. But for tech aficionados and AI enthusiasts like Justin Maier and Maxfield Hulker, Midjourney’s closed-source mo ... Show More
49m 45s