CrazyBoyM 8f4a130371 ci: add GitHub Actions test workflow with real agent tests
Tests:
- test_bash_echo: Run simple bash command
- test_file_creation: Create and verify file
- test_directory_listing: List directory contents
- test_file_search: Search with grep
- test_multi_step_task: Multi-step file manipulation

Each test runs a complete agent loop (API call -> tool execution -> continue).

Required secrets:
- TEST_API_KEY: API key for testing
- TEST_BASE_URL: API base URL
- TEST_MODEL: Model (default: claude-3-5-sonnet-20241022)

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-01-24 23:58:04 +08:00
..