I'm working on my presentation for the coq bug minimizer, and figured some of you all might be amused by this plot I made image.png
hm, looks like bug reports was going down after a peak in 2018, but then 2020 broke the trend. I guess there can be many explanations for that
If you want to have statistically interesting results, you should remove Jason's reports from the data. Otherwise, you have too much that depends on the circumstances of a single individual. In my "Impact of switching bug trackers" paper, Jason's data was removed (as well as mine, but for different reasons).
but what is really a "typical" Coq issue reporter? It seems most are generated by a small number of people with individual circumstances playing a big part
Here's bug reports by person:
image.png
And here's what you get if you exclude me:
image.png
Median # bug reports is 1
Mean # bug reports is 9.2 if you include me, 7.2 if you don't
If you exclude everyone with just 1 bug report, the median is 4; mean is 19.8 if you include me, 15.3 if you don't
OK, so I guess one might want some cluster/percentile analysis. How many reports do top-X (X = 10?) reporters constitute out of the whole
where did all the non-jim people who were on the first image go?
oh the x labels don't include all the points
that's confusing
Yeah, I think Excel is bad at bar plots with 984 bins...
ah, I guess I missed the opportunity for appearing in that chart by mostly working on ecosystem side...
But more seriously, I think the best approach these days instead of jumping to create an issue is to start talking about the problem on Zulip or similar, and then report at request of devs (who can often tell if it's novel or not)
at least for regular Coq users
OK, so I guess one might want some cluster/percentile analysis. How many reports do top-X (X = 10?) reporters constitute out of the whole
ah, so if I'm reading it right, top 10 reporters reported about 40% of all issues
intuitively, this doesn't seem so bad, I guess in small projects it's likely to be > 90% by top 10
In case anyone else is interested in playing with the data in excel: Coq-Bug-Report-Plots.xlsx
And the scripts I used to scrape the data are in https://github.com/JasonGross/coq-bug-minimizer-paper/tree/main/presentation
If you want to see it broken out by years:
image.png
Coq-Bug-Report-Plots.xlsx
Last updated: Sep 15 2024 at 13:02 UTC