Situation
- nabledge-6 skill exists with knowledge about Nablarch 6 batch processing
- No validation exists for real-world code generation capability
- proman is a concrete use case with design docs and development rules available
- Validation scope: design docs + proman rules → production code → passing unit tests
Pain
Developers face uncertainty when:
- Evaluating if nabledge-6 can generate working production code
- Understanding success rate and quality of generated code
- Identifying what works and what doesn't in practice with real-world constraints (no web access)
Benefit
Developers can:
- Understand nabledge-6's practical code generation capability through documented test results
- Identify strengths and weaknesses through analysis of 5 documented attempts
- Make informed decisions about using nabledge-6 for real development based on evaluation results
Success Criteria
Test Conditions:
- Input: Design docs + proman development rules only
- Tool: nabledge-6 skill only (no web access)
- Goal: Implement production code until unit tests pass
- Iterations: 5 attempts
- Deliverables: Work logs, result analysis, evaluation report