Commit graph

249 commits

Author SHA1 Message Date
38099c3358
feat(code): Output data as csv to stdout on script run
When running data python script (`src/data.py`) directly through the command line,
we now use pandas to output the collected data directly as a csv to stdout.
Can then be redirected to e.g. a file to save the data in csv format.
2023-12-14 18:06:39 +01:00
73b96a757a
feat(data): Extract Militaru2019 2023-12-14 17:42:41 +01:00
b594233d01
feat(data): Extract Davies2022 2023-12-14 16:33:21 +01:00
c3369e83e7
chore(script): Change methodology and inequalities headlines 2023-12-13 21:15:00 +01:00
2fc730812f
feat(data): Extract Khan2021 2023-12-13 21:14:31 +01:00
124e520835
chore(data): Sort out full-text studies 2023-12-13 20:48:28 +01:00
f63cf44f74
feat(data): Extract Standing2015 2023-12-13 20:02:04 +01:00
30b649f31f
chore(script): De-indent intervention headlines
Until we have thematic areas, de-indent the individual headlines for the time being.
2023-12-13 15:46:21 +01:00
84f1d64492
chore(script): Update TODOs and FIXMEs in script 2023-12-13 15:04:19 +01:00
97f6214932
chore(script): Refactor synthesis toward interventions
Refactor to start breakdown from interventions and then move towards
individual categories and inequality breakdowns within the sections.

It should follow intervention -> inequality/region logic.
2023-12-13 14:44:08 +01:00
8c2c83cd9a
chore(script): Include superseded in duplicate count
Included studies marked as 'out::superseded' in the duplication removal step.
2023-12-13 14:42:34 +01:00
c35b205345
chore(data): Update library and country separation 2023-12-13 14:40:16 +01:00
9789936712
feat(data): Extract Xu2021 2023-12-13 14:39:37 +01:00
cf4c39a3c8
feat(notes): Add simple measure documentation
From documents in DB, requires more attention and revision.
2023-12-13 13:50:41 +01:00
6e4f19ac3f
!fix(data): Fix measures collected
Fixed the measures and directions collected to use Gini, Atkinson, .. measures or
absolute employment, poverty, etc.
2023-12-13 13:50:40 +01:00
50409bb9d3
fix(data): Change Debowicz2014 to simulation design 2023-12-12 21:37:38 +01:00
d8a018b7b0
feat(notes): Add meeting notes December 2023-12-12 20:12:20 +01:00
7db335a479
feat(data): Extract Cieplinski2021 2023-12-12 20:08:02 +01:00
874e708112
chore(data): Sort out irrelevant study 2023-12-12 20:07:45 +01:00
26afa3b8ea
fix(script): Add missing Rosen2014 annotation 2023-12-12 08:48:25 +01:00
30d059d68b
chore(script): Imply regional/findings breakdowns 2023-12-12 08:46:59 +01:00
9855256b00
feat(notes): Add intermittent findings and progress
Created small up-to-date quick glance document for findings and
data set.
2023-12-11 17:17:43 +01:00
b5e467e016
feat(code): Add examples of list handling notebook
Extracts interventions/inequalities and explodes them for value counts.
2023-12-11 17:17:42 +01:00
85497854c1
feat(data): Extract Adams2015 2023-12-11 17:17:42 +01:00
6041d00d8b
chore(data): Categorize further studies for tags
Tag studies for income, gender, racial and ethnicity inequalities
based on title keywords.
2023-12-11 17:10:31 +01:00
6104f2d274
feat(data): Extract Rosen2014 2023-12-11 09:49:46 +01:00
b09e65476b
fix(data): Unify spelling Emigh2018 intervention 2023-12-10 19:49:31 +01:00
ef06e4bb88
chore(data): Sort out Dumas2018 2023-12-10 19:29:47 +01:00
af2df5736c
chore(script): Refactor pandas data ingestion
Load data at top of file, then use chained methods for
visualizations.
2023-12-10 19:29:29 +01:00
0e29a3332c
feat(data): Extract Emigh2018 2023-12-10 19:29:28 +01:00
ca7eab92d3
chore(script): Refactor pandas data ingestion
Load data at top of file, then use chained methods for
visualizations.
2023-12-10 18:00:27 +01:00
1ba2daeacd
feat(data): Extract Emigh2018 2023-12-10 17:59:19 +01:00
50ce8b6310
feat(data): Extract Shepherd-Banigan2021 2023-12-10 17:59:17 +01:00
85c6340b5c
fix(data): Remove 'poverty' as inequality category
Poverty is a status but not directly inequality
2023-12-10 17:57:50 +01:00
7258915ef1
feat(script): Extract Gates2000 2023-12-10 11:33:56 +01:00
efd6e285c8
feat(script): Extract Hardoy2015 2023-12-10 11:04:19 +01:00
8e7f99b20d
chore(script): Refactor dataframe loading code
Improved readability of dataframe loading, used improved chaining
and some list comprehension to make it much less messy.
2023-12-09 23:46:21 +01:00
3f05283f6d
chore(script): Refactor screening flowchart calculations
Made it much clearer and simpler how numbers are calculated for the
screening flowchart. Now we just keep the actual numbers in memory
and not a copy of the whole bibtex library for each calculation step.

Also renamed the bibtex variables to be more sane, `bib_sample_raw_db`
(the raw, unaltered sample returned from querying the databases), and
`bib_sample` for our working sample including database queries and
snowballing studies but already deduplicated (since we can't keep
an unduplicated version on Zotero).
2023-12-09 22:06:05 +01:00
708fa90d29
chore(script): Refactor file top imports 2023-12-09 21:54:26 +01:00
c3b0d8cfd6
fix(script): Fix display of fuller-width figures 2023-12-09 20:41:29 +01:00
a681a5b6ff
fix(data): Fix display of table data linebreak 2023-12-09 20:40:56 +01:00
51132d818d
chore(repo): Remove margin captions from html
Removed captions from figures and references in the margin and display
in the normal text-flow instead.
2023-12-09 19:05:10 +01:00
5d17d5091f
feat(script): Add introductory hook to income inequality
Added small hook which can be built upon later, introduces concept of primacy of
vertical income inequality but how it can exacerbate issues.
2023-12-09 19:04:26 +01:00
2cf2a03d87
fix(script): Remove visualization per literature type
For the time being removed the distinction between gray and white literature
as our sample did not lend itself to the dissection. May reintroduce later.
2023-12-09 18:58:27 +01:00
56e18b1285
fix(script): Calculate country groups for sample 2023-12-09 18:56:42 +01:00
3b6ed7a79b
feat(code): Add quick data querying notebook
Added a notebook which has the sole point of quickly allowing me
to grab and look at the data of the processed sample I am creating.

I.e. quickly list and uniq all interventions/outcomes/inequalities,
doing a tiny calculation or similar.
2023-12-09 18:50:43 +01:00
e2c629ff33
fix(data): Improve yml template whitespace preparation 2023-12-09 18:49:00 +01:00
b83f39ffc1
feat(data): Extract Wong2019 2023-12-09 18:48:59 +01:00
8bef9033f8
feat(data): Extract Alinaghi2020 2023-12-09 18:48:39 +01:00
9c8b42d3bc
feat(data): Extract Field2019 2023-12-09 17:00:05 +01:00