2c5cf37b2c
Add appendix text
2025-10-08 20:30:57 +02:00
06ee312c80
Fix monthwise package accumulation
...
Cut off double-counted months from year.
2025-10-08 20:30:39 +02:00
9e3726402d
Add kernel text
2025-10-08 16:03:27 +02:00
9687eb662b
Finish package stats section
2025-10-08 15:09:08 +02:00
707632fb7d
Add daily filesize text
2025-10-08 09:27:32 +02:00
cfc8ecc4fd
Start transferring to quarto file
2025-10-07 21:56:54 +02:00
c51970507d
Add jupyter caching to quarto
2025-10-07 21:09:16 +02:00
c30238d4ed
Add tentative outline to qmd
2025-10-07 17:59:45 +02:00
a5ca2a241f
Add relative package ownership plot
2025-10-07 17:59:45 +02:00
43bd80e30e
Add quarto output dir
2025-10-07 17:59:44 +02:00
d8d2d1b16c
Add first plot to quarto article
2025-10-07 17:59:44 +02:00
cccc2f6bd4
Add jupyter as dev dependency
...
We technically only need it to 'develop' the article. Not sure if it
should go in another optional group or just the main dependencies - but
e.g. marimo notebook analyses can be run without, and it's big.
2025-10-07 17:59:43 +02:00
f10e442435
Add quarto project
2025-10-07 17:59:42 +02:00
95b3829d04
[DATALAD] Update subdataset
2025-10-07 17:59:01 +02:00
443c4c98cd
Major performance improvements for weekly and monthly plots
...
Doing most of the aggregation in polars with the streaming engine
prevents memory overload (as compared to letting ggplot do it).
2025-10-01 18:04:55 +02:00
728ec37bda
Modify popcorn notebook to run with new data structure
2025-10-01 15:44:53 +02:00
9628d7a4d8
Merge branch 'legacymain'
2025-10-01 14:42:57 +02:00
66b0464809
Remove reduntant cleaning files
2025-10-01 14:42:07 +02:00
0618814c49
Make limits for quick iteration toggleable
2025-10-01 14:35:00 +02:00
63ae4a1982
Get data from parquet files
2025-10-01 14:34:59 +02:00
07bd122aaf
Change to streaming engine for lazy operations
2025-10-01 14:34:59 +02:00
e393768d30
Update wordings and lazy collects
2025-10-01 14:34:58 +02:00
9d64e93486
Add package install count distribution
2025-10-01 14:34:57 +02:00
ad50b19631
Rename weekly package installations plot
2025-10-01 14:34:57 +02:00
094aa34758
Overhaul text cells
2025-10-01 14:34:56 +02:00
08737e1baa
Add in-text kernel usage dates
2025-10-01 14:34:56 +02:00
ce38024569
Add kernel analysis plots
2025-10-01 14:34:55 +02:00
4d5aa73de7
Add kernel cleaning func
2025-10-01 14:34:52 +02:00
9513e6544e
Use new csv for unique installs
2025-10-01 14:33:03 +02:00
e00217f2fe
Extract cleaning date checking
2025-10-01 14:33:02 +02:00
4c9518cf67
Adapt functions to new csv data layout
2025-10-01 14:33:01 +02:00
91d64f428c
Add data cleaning script
2025-10-01 14:33:01 +02:00
45920009d6
Add additional text cells
2025-10-01 14:33:00 +02:00
bc6c9d1002
Improve package updates barplot with least updated
2025-10-01 14:33:00 +02:00
43d2bac7ad
Add top updated packages barplot
2025-10-01 14:32:59 +02:00
4984289f69
Provide raw lazyframe
2025-10-01 14:32:59 +02:00
499a819c90
Add unique daily stat uploads lineplot
2025-10-01 14:32:58 +02:00
efa08b4b54
Add monthwise downloads barplot
2025-10-01 14:32:58 +02:00
86b3659f0f
Turn main dataframe into lazyframe
2025-10-01 14:32:57 +02:00
07c45ca205
Add weekday barplot
2025-10-01 14:32:56 +02:00
343b75c9e4
Remove unused functions
2025-10-01 14:32:56 +02:00
ed7ed620d4
Extract magic row limit variable
2025-10-01 14:32:55 +02:00
83dfce1a61
Change to weekly downloads
2025-10-01 14:32:55 +02:00
8fc84287ef
Add daily downloads graph
2025-10-01 14:32:54 +02:00
761cd9b2ad
Add watchdog dev dependency
...
For the --watch option in marimo
2025-10-01 14:32:54 +02:00
62e3cedb03
Add descriptive file size text
2025-10-01 14:32:53 +02:00
cbda8dca55
Parse JSON Package downloads
2025-10-01 14:32:52 +02:00
97e4d256bb
Add mod time checking
2025-10-01 14:32:52 +02:00
42fd0e5a24
Set up linters for data science
2025-10-01 14:32:51 +02:00
1701605939
Add basic size analysis
2025-10-01 14:32:51 +02:00