No description

Find a file

Félix Dorn 720f21a85b wip		2025-04-28 11:02:28 +02:00
.env.example	progress	2025-04-26 23:38:19 +02:00
.envrc	progress	2025-04-26 23:38:19 +02:00
.gitignore	wip	2025-04-28 11:02:28 +02:00
add_task_estimates_to_samples.py	wip	2025-04-28 11:02:28 +02:00
analysis.ipynb	wip	2025-04-28 11:02:28 +02:00
create_onet_database.sh	progress	2025-04-26 23:38:19 +02:00
data_enrichment.ipynb	wip	2025-04-28 11:02:28 +02:00
enrich_task_ratings.py	wip	2025-04-28 11:02:28 +02:00
evaluate_llm_time_estimations.ipynb	wip	2025-04-28 11:02:28 +02:00
flake.lock	progress	2025-04-26 23:38:19 +02:00
flake.nix	init	2025-04-26 13:32:29 +02:00
legacy.ipynb	wip	2025-04-28 11:02:28 +02:00
pyproject.toml	wip	2025-04-28 11:02:28 +02:00
README.md	wip	2025-04-28 11:02:28 +02:00
uv.lock	wip	2025-04-28 11:02:28 +02:00

Presentation

Notebooks

data enrichment - contains the code to gather things from the O*NET data, BLS's OEWS database (unused for now), Barnett's data...
prompt evaluation - the playground used to evaluate change in hyperparameters (system prompt, user prompt, schema, model...)
analysis - code to generate the graphs in the paper
legacy - if there are some missing pieces, it's worth looking in there.

To re-run everything, you need python and uv up and running, if you use have nix installed, run

nix develop .#impure

and then uv run ... as requested in the notebooks.

If some things are missing, email dorn@xfe.li, I'm usually reactive.

Copy .env.example to .env and fill in OPENAI_API_KEY. The total run and experiments cost less than <10$.