agentic-studio

Author	SHA1	Message	Date
jaakko	7d49d62f81	CodeBench: kompaktoitu Go handlers.go golden — error handling yhdelle riville	2026-04-14 21:15:57 +03:00
jaakko	8fc31f2a53	CodeBench: kierroskohtainen output-dir + tiivistetty Go golden example - runPipeline saa round-parametrin, dir: model__scenario__r1, __r2 jne. - todo-go.md testit 6→4 (poistettu list+update toisteiset), 466→370 riviä	2026-04-14 20:57:50 +03:00
jaakko	f3cd1347ab	CodeBench: Go-tuki — Chi + SQLite + httptest - Golden example: todo-go/ (6/6 testit läpi) - todo-go.md golden reference - prompts/code-go.md koodigenerointi-prompti - Dockerfile.go-test (golang:1.23-alpine) - benchmark.mjs: LANG_CONFIG, parseTestOutput, prompt/golden-valinta Go:lle - Käyttö: node benchmark.mjs --lang go --models qwen2.5-coder:32b	2026-04-14 19:20:18 +03:00
jaakko	0975385101	CodeBench: reqwest 0.13 + Docker volume cache + rust:latest - reqwest 0.12 → 0.13, rustls-tls → rustls (golden, Dockerfile, promptit) - Docker volume cache: kipina-cargo-registry + kipina-cargo-target - rust:latest (1.94) + cmake (aws-lc-sys vaatii) - Dockerfile yksinkertaistettu — esikäännös ei toimi, volume hoitaa - Golden example 10/10 testattu uudella setupilla	2026-04-14 18:42:05 +03:00
jaakko	2f602717b8	CodeBench: tiivistetty todo-rs.md golden example 540→331 riviä - handlers.rs: tiiviimpi muotoilu, kommentit kuvaavat patternia - tests: 10 testiä → 4 avaintestiä (create, get, not_found, delete) - spawn_server tiivistetty - Kaikki kriittiset patternit säilyvät: RETURNING, fetch_optional, rows_affected	2026-04-14 17:50:19 +03:00
jaakko	477c21efd0	CodeBench: Rust golden example — todo-rs.md + kielitietoinen valinta - Luotu todo-rs.md golden example Rust-referenssitoteutuksesta - getGoldenForModel() huomioi nyt LANG: todo.md → todo-rs.md Rust-moodissa - Korjattu golden-compact-rs.md /:id → /{id} bugi - Juurisyy: malli sai Python golden examplen mutta piti generoida Rustia	2026-04-14 17:37:38 +03:00
jaakko	5d0baf3ff1	CodeBench: combined-readme.md — todo + blog golden example 8b:lle Molemmat esimerkit (single entity + FK relaatio) yhdessä tiedostossa. 1699 tokenia, 10.4% kontekstista. 8b näkee konkreettisen FK-patternen.	2026-04-14 14:54:12 +03:00
jaakko	a25c52cff4	CodeBench: mallikohtainen golden example (profiles.json → golden kenttä) qwen3-coder:30b → todo.md (annotaatiot) qwen3:8b → todo-readme.md (GitHub README -muoto, tutuin koulutusdata) Golden example ladataan dynaamisesti per malli pipelinen sisällä.	2026-04-14 14:04:28 +03:00
jaakko	e54c1b057c	Golden example: tarkat 6 testiä per entiteetti, ei ylimääräisiä Malli generoi test_search, test_filter yms. joita ei ole endpointeissa. Nyt todo.md listaa tarkalleen 6 testiä per entiteetti nimillä.	2026-04-14 12:56:50 +03:00
jaakko	6a40ca5730	CodeBench: golden example markdown-muodossa (koodi + selitykset) todo.md yhdistää koodin ja annotaatiot: miksi pattern on valittu, mitä EI saa tehdä. 1567 tokenia (vs raaka 1340, compact 335). Benchmark lataa .md-version oletuksena, fallback erillisiin tiedostoihin.	2026-04-14 12:38:25 +03:00
jaakko	e7b33b7d6f	CodeBench: Rust-tuki (--lang rust), golden example todo-rs, Dockerfile.cargo-test - golden-examples/todo-rs/: Axum 0.8 + SQLx + SQLite, 10 testiä - prompts/code-rs.md: Rust-koodingenerointiprompt - Dockerfile.cargo-test: rust:1.87-slim testikontti - benchmark.mjs: --lang python\|rust, kieliriippuvainen golden example, parseri tukee cargo test -tuloksia, src/ alihakemistot	2026-04-14 10:55:50 +03:00
jaakko	9da5540ca2	Golden example: todo-rs (Axum + SQLx + SQLite)	2026-04-14 10:50:16 +03:00
jaakko	7b27800390	Siirrä kipina-codebench projektin päätasolle	2026-04-14 09:44:14 +03:00

13 Commits