Skip to main navigation Skip to search Skip to main content

A method for detecting abnormal users with fake stars

  • Beihang University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In GitHub, users star interesting repositories, and the number of stars is viewed as the significant measure of repository popularity. Some repositories obtain fake stars by unjustified means, which ruin efforts that communities have made stars a valuable indicator, and bring negative impacts in GitHub. Therefore, it is important to stop abusing GitHub stars and detect abnormal users who provide fake stars. In this paper, we first define features from the user dimension and repository dimension. Then we perform differential analysis and find that most of the features show a significant difference between abnormal users and normal users. Next, we propose a method AUDetec for Abnormal User Detection. The method AUDetec uses the decision tree to detect the abnormal users based on two features, including the sum of repositories starred by the user and the median value of the number of days since creation for repositories starred by the user. We evaluate the effectiveness of AUDetec on the data set which contains 120 abnormal users and 240 normal users. The experiment results show that AUDetec has a high performance by achieving an accuracy of 99.86% on average.

Original languageEnglish
Title of host publicationDMSVIVA 2022 - Proceedings of the 28th International DMS Conference on Visualization and Visual Languages
PublisherKnowledge Systems Institute Graduate School, KSI Research Inc.
Pages63-68
Number of pages6
ISBN (Electronic)1891706551, 9781891706554
DOIs
StatePublished - 2022
Event28th International DMS Conference on Visualization and Visual Languages, DMSVIVA 2022 - Pittsburgh, United States
Duration: 29 Jun 202230 Jun 2022

Publication series

NameDMSVIVA 2022 - Proceedings of the 28th International DMS Conference on Visualization and Visual Languages

Conference

Conference28th International DMS Conference on Visualization and Visual Languages, DMSVIVA 2022
Country/TerritoryUnited States
CityPittsburgh
Period29/06/2230/06/22

Keywords

  • Abnormal user detection
  • Fake star
  • GitHub
  • Open source software
  • Repository popularity

Fingerprint

Dive into the research topics of 'A method for detecting abnormal users with fake stars'. Together they form a unique fingerprint.

Cite this