When AI Tools Work Best (And When They Absolutely Don't)
A practical guide to knowing when to lean on AI coding tools and when to step away. Based on 50+ projects where I've seen both spectacular wins and expensive failures.
KEY TAKEAWAYS
- •AI tools work brilliantly for: UI generation, CRUD operations, standard patterns, prototyping, and boilerplate code
- •AI tools consistently fail at: novel business logic, security-critical code, multi-system integrations, performance optimization, and domain expertise
- •The sweet spot is using AI for the 80% that is standard patterns and human judgment for the 20% that is unique to your business
- •Projects that stay within standard patterns can be mostly AI-built. Projects with custom logic need human oversight
- •The most expensive projects are the ones that should have involved human judgment from the start but did not
After two years of shipping products with AI coding tools, I can predict with reasonable accuracy which projects will go smoothly and which will hit walls.
It is not about the tool. It is about the project type. Some kinds of work play to AI strengths. Others run directly into AI limitations.
Here is the breakdown.
Where AI Tools Shine
UI Generation
This is the single best use case for AI coding tools. Describe a screen, get a working implementation with components, layout, and styling. The AI has seen millions of UI patterns in training data and generates reasonable implementations quickly.
REAL EXAMPLE
RiskPod — Agencies Quoted £130K+. Actual Cost: £40K.
Challenge:
Recruitment consultancy founder approached three agencies to build a compliance contractor marketplace. All quoted £130K+ with 6-month timelines.
Solution:
Built the full platform — AI-powered onboarding, document verification, intelligent matching, admin dashboard — in 30 days for £40K.
Results:
- •67% less than agency quotes
- •550+ signups in 48 hours
- •£10K/month ongoing retainer
Landing pages, dashboards, forms, settings screens, profile pages — anything where the challenge is mostly visual design and layout. AI tools do this exceptionally well.
Standard CRUD Operations
Create, read, update, delete. The bread and butter of most applications. User submits form. Data saves to database. List view shows all records. Detail view shows one record. Edit form modifies it.
This pattern is so common in training data that AI tools generate it reliably. If your application is mostly CRUD with a nice UI, AI tools can handle most of the work.
Boilerplate Code
REAL EXAMPLE
PulseIQ — Agency Estimated 12 Months. Built in 30 Days.
Challenge:
An optometry practice needed a comprehensive operations platform. The agency estimate was 12 months of calendar time.
Solution:
Built a multi-tenant SaaS platform with 8 modules including AI-driven insights and staff development tools in 30 days.
Results:
- •30 days vs 12 months
- •8 feature modules
- •Fraction of agency cost
Project scaffolding, configuration files, API route structures, database schema definitions, authentication flows with standard patterns. The boring, repetitive setup work that every project needs.
AI tools are tireless at generating boilerplate. What would take hours of copy-paste and adaptation takes minutes.
Rapid Prototyping
When you need to test an idea quickly — something you can click through and show to potential users — AI tools are unmatched. Speed matters more than quality at this stage, and AI delivers speed.
Learning Standard Patterns
If you are trying to understand how a technology works, asking AI tools to generate examples with explanations is an excellent learning method. The code quality might not be production-grade, but for learning, that does not matter.
Where AI Tools Struggle
Novel Business Logic
Every business has rules that are unique to that business. How pricing is calculated. What happens when a subscription downgrades. Which users can see which data under which conditions. The specific workflows that make your product different from generic alternatives.
AI tools can not reason about business logic they have never seen. They guess based on similar patterns, and those guesses are often wrong in ways that only become apparent after launch.
Security-Critical Code
Authentication, authorization, data protection, input validation, encryption. Anything where getting it wrong means a security incident.
AI-generated security code looks correct but often is not. It uses the right function names and follows recognizable patterns, but misses critical details: tokens stored insecurely, sessions that do not expire, validation that can be bypassed.
Multi-System Integrations
Connecting your application to external APIs, handling webhooks, managing OAuth flows, synchronizing data between systems. The complexity explodes because you are now dealing with two systems that were not designed to work together.
AI tools struggle here because the edge cases are specific to the combination of systems, which is unlikely to be well-represented in training data.
Performance Optimization
Making code fast requires understanding why it is slow. AI tools can generate code that works, but generating code that works efficiently under load requires reasoning about database query patterns, memory usage, caching strategies, and concurrent request handling.
These are areas where AI-generated code consistently underperforms human-optimized code.
Domain Expertise
Healthcare compliance, financial regulations, legal requirements, industry-specific workflows. Any area where getting it wrong has consequences beyond a bug fix.
AI tools do not have domain expertise. They have patterns from training data, which may or may not reflect current regulations or industry best practices.
The Project Type Test
Based on these patterns, here is a quick test for whether your project is AI-friendly:
Mostly AI-buildable: Your product is essentially a database with a nice UI. Standard user flows. No complex business logic. No payments or simple one-time payments. No regulatory requirements. Examples: portfolio site, blog, simple dashboard, basic e-commerce.
AI-assisted but human-supervised: Your product has some custom business logic but it follows recognizable patterns. Subscription payments with standard tiers. User roles with straightforward access rules. Basic integrations with common services. Examples: SaaS with subscriptions, marketplace MVP, project management tool.
Human-led with AI acceleration: Your product has novel business logic, complex integrations, or regulatory requirements. Multi-tenant data with strict isolation. Payment flows with edge cases. Industry-specific compliance. Examples: fintech applications, healthcare platforms, multi-sided marketplaces.
The Sweet Spot
The best results come from using AI tools for what they do well and human judgment for what they do not.
Use AI for: UI generation, boilerplate, standard patterns, rapid iteration, and the 80 percent of code that looks like code the AI has seen before.
Use humans for: architecture decisions, security review, business logic validation, performance optimization, and the 20 percent of code that is unique to your business.
This is not about AI versus humans. It is about the right tool for the right task. And right now, AI tools are excellent assistants but unreliable decision-makers.
The founders who succeed are the ones who understand what AI tools are actually good at and use them accordingly.
---
Related reading
- [What AI Tools Still Need a Human For](https://hellocrossman.com/resources/blog/what-ai-tools-still-need-human-for)
- [Cursor vs Replit vs Bolt vs Lovable](https://hellocrossman.com/resources/blog/cursor-vs-replit-vs-bolt-vs-lovable)
- [Specifications for AI Coding Tools](https://hellocrossman.com/resources/blog/specifications-ai-coding-tools-prompts)
- [Agentic Engineering Explained](https://hellocrossman.com/resources/blog/agentic-engineering-karpathy-explained)
Sources
Tom Wild
Founder & Product Leader
Founder of HelloCrossman, helping startups and scale-ups ship products faster with AI-accelerated development. Passionate about turning ideas into reality in 30 days or less.
Building in Public
Follow along as I build tools, ship products, and share what actually works.
No spam. Unsubscribe anytime.