Apps Building Process

Prompt Engineering Key Learnings Technical Details App Analysis Model Evaluation

Build Sprint Context

These notes were captured during a main build sprint when we were hitting model rate limits. At that time we settled on Kimi K2 as the production pick, and the final entry in this set lands in mid-October 2024.

Project Context

This case study documents the work done as a Learning Engineer at Playlab to help build system prompts and optimize AI-powered educational applications deployed within high schools in Ghana. The project focused on creating curriculum-aligned assessment and lesson planning tools for Ghana's Senior High School (SHS) system.

Cost

Optimization

Migrating from expensive models to cost-effective alternatives while maintaining quality.

Prompt

Engineering

Refining system prompts with categorization and explicit formatting rules.

Model

Evaluation

Comprehensive testing of multiple AI models for production fitness.

Timeline Summary

October 1, 2024

Initial Cost Analysis

Identified that apps are too pricey. Intervention Courses - Teacher Planning app is the most used, currently using Claude Sonnet 4.5. Decision made to keep Sonnet 4.5 for this critical app given importance, or potentially move to cheaper model.

October 2, 2024

Rate Limits and Cost Concerns

Sonnet models hitting rate limits and high costs. Goal: get as many apps as possible to Kimi K2 given best cheap tool calling ability. Started reviewing Sonnet conversations and testing same conversations with Kimi.

October 3, 2024

Doubling Down on Kimi K2 Migration

Costs became unbearable. Doubling down on moving all apps to Kimi K2 and improving tool calling. Also addressing issue with Claude Sonnet constantly providing assessments with question B being the main answer.

View full timeline