.. | ||
analysis.ipynb | ||
bck_estimates.csv | ||
data_enrichment.ipynb | ||
evaluate_llm_time_estimations.ipynb | ||
legacy.ipynb | ||
loss.py | ||
onet_explorer_app.py | ||
README.md | ||
schema.txt | ||
tasks_estimateable.csv | ||
tasks_with_estimates.csv |
Presentation
Notebooks
- data enrichment - contains the code to gather things from the O*NET data, BLS's OEWS database (unused for now), Barnett's data...
- prompt evaluation - the playground used to evaluate change in hyperparameters (system prompt, user prompt, schema, model...)
- analysis - code to generate the graphs in the paper
- legacy - if there are some missing pieces, it's worth looking in there.
Running the non-notebook code
To re-run everything, you need python and uv up and running, if you use have nix installed, run
nix develop .#impure
and then uv run ...
as requested in the notebooks.
If some things are missing, email dorn@xfe.li, I'm usually reactive.
Copy .env.example
to .env
and fill in OPENAI_API_KEY. The total run and experiments cost less than <10$.