SPIKE: Caterpillar set-up for claude-agent#52
SPIKE: Caterpillar set-up for claude-agent#52Mahesh Kamble (ma-gk) wants to merge 1 commit intomainfrom
Conversation
|
Nice start Mahesh!
|
Thanks prasadlohakpure, I split the skills by task so the AI agent can use only the relevant instructions, which should improve accuracy, reduce hallucination, and save context. It also makes task-specific updates easier and more scalable. For common changes that need to be applied everywhere, we can add one parent/base skill and propagate those updates through all task-specific skills. So a hybrid approach may work best: shared parent skill for common logic, and separate skills for each task. What do you think about the parent skill approach for handling future common changes across all skills? |
SPIKE: Claude Agent Set-up for Caterpillar
Summary
Adds a comprehensive Claude Code agent configuration to the Caterpillar data pipeline tool. This introduces AI-assisted pipeline authoring, validation, debugging, and optimization through a structured set of agents, skills, rules, hooks, and commands — enabling developers to build, review, and ship pipelines faster with Claude as a copilot.
What's Included
47 new files across the
.claude/directory and a top-levelCLAUDE.mdproject guide.CLAUDE.md— Project Context File.claude/agents/— 9 Specialized Sub-Agentspipeline-builder-interactivepipeline-lintpipeline-validatepipeline-permissionspipeline-optimizerpipeline-reviewpipeline-debuggerpipeline-runnersource-schema-detector.claude/skills/— 22 Task-Specific SkillsOne skill per Caterpillar task type (
file,kafka,sqs,http,jq,split,join,replace,flatten,xpath,converter,compress,archive,sample,delay,echo,sns,http-server,aws-parameter-store,heimdall) plus two meta-skills:pipeline-builder— schema reference for direct YAML generationpipeline-tester— generates step-by-step test plans with probe pipelines.claude/rules/— 3 Authoring Rulespipeline-authoring.md— conventions for task naming, ordering, and structurepipeline-security.md— credential handling, secret management guardrailspipeline-testing.md— testing standards and probe pipeline patterns.claude/commands/— 7 Diagnostic CommandsQuick-check commands for infrastructure connectivity:
check-aws,check-http,check-kafka,check-s3,check-sns,check-sqs,check-ssm.claude/hooks/— 3 Lifecycle Hookspreflight-check.sh— pre-tool-use validation before shell commandsvalidate-on-save.sh— auto-validates pipeline YAML on file write/editrun-summary.sh— post-execution summary after shell commands.claude/scripts/&.claude/settings.jsonrun-pipeline.sh— helper script to build and run pipelinessettings.json— permission allow/deny lists and hook configurationMotivation
Caterpillar pipelines have a rich task ecosystem (18 types) with complex configuration options (auth, context keys, JQ transforms, AWS integrations). This set-up gives Claude the domain knowledge and guardrails to:
Test Plan
CLAUDE.mdis picked up as project context in Claude Code sessionspipeline-reviewagent runs the full lint → validate → permissions → optimize sequencevalidate-on-savetriggers on.yamlwrites,preflight-checkruns before shell commandsgit push --force,aws s3 rm, etc.)check-aws,check-sqs, etc.) report connectivity status accuratelypipeline-runneragent against example pipelines intest/pipelines/examples/