tool-evaluation
$
npx mdskill add elophanto/EloPhanto/tool-evaluationEvaluate tools with weighted scoring and TCO analysis for strategic adoption.
- Identifies hidden costs and compares vendor stability against business needs.
- Integrates web_search for market research and shell_execute for testing.
- Calculates ROI scenarios using quantitative metrics and qualitative feedback.
- Delivers structured recommendations with clear success metrics and timelines.
SKILL.md
.github/skills/tool-evaluationView on GitHub ↗
--- name: tool-evaluation description: Evaluate, test, and recommend tools and platforms with weighted scoring, TCO analysis, and strategic adoption planning. Adapted from msitarzewski/agency-agents. --- ## Triggers - tool evaluation - software comparison - platform assessment - tool recommendation - vendor evaluation - technology assessment - tool selection - software review - buy vs build - tool adoption - platform comparison - TCO analysis - tool ROI - vendor selection - technology stack ## Instructions ### Requirements Gathering and Tool Discovery - Conduct stakeholder interviews to understand requirements and pain points - Research market landscape and identify potential tool candidates using `web_search` - Define evaluation criteria with weighted importance based on business priorities - Establish success metrics and evaluation timeline ### Comprehensive Tool Testing - Set up structured testing environment with realistic data and scenarios - Test functionality, usability, performance, security, and integration capabilities - Conduct user acceptance testing with representative user groups - Document findings with quantitative metrics and qualitative feedback using `shell_execute` ### Financial and Risk Analysis - Calculate total cost of ownership (TCO) including hidden costs and scaling fees - Assess vendor stability and strategic alignment - Evaluate implementation risk and change management requirements - Analyze ROI scenarios with different adoption rates and usage patterns - Use `knowledge_write` to store evaluation frameworks and past results ### Implementation Planning and Vendor Selection - Create detailed implementation roadmap with phases and milestones - Negotiate contract terms and service level agreements - Develop training and change management strategy - Establish success metrics and monitoring systems ### Evaluation Standards - Always test tools with real-world scenarios and actual user data - Use quantitative metrics and statistical analysis for comparisons - Validate vendor claims through independent testing and user references - Document evaluation methodology for reproducible decisions - Consider long-term strategic impact beyond immediate feature requirements - Factor in training, migration, and change management costs ### Weighted Evaluation Criteria - Functionality: 25% -- Core feature completeness - Usability: 20% -- User experience and ease of use - Performance: 15% -- Speed, reliability, scalability - Security: 15% -- Data protection and compliance - Integration: 10% -- API quality and system compatibility - Support: 8% -- Vendor support quality and documentation - Cost: 7% -- Total cost of ownership and value ## Deliverables ### Tool Evaluation Report Template ```markdown # [Tool Category] Evaluation and Recommendation Report ## Executive Summary **Recommended Solution**: [Top-ranked tool with key differentiators] **Investment Required**: [Total cost with ROI timeline and break-even analysis] **Implementation Timeline**: [Phases with milestones and resource requirements] **Business Impact**: [Quantified productivity gains and efficiency improvements] ## Evaluation Results **Tool Comparison Matrix**: [Weighted scoring across all criteria] **Category Leaders**: [Best-in-class tools for specific capabilities] **Performance Benchmarks**: [Quantitative performance testing results] **User Experience Ratings**: [Usability testing results across user roles] ## Financial Analysis **Total Cost of Ownership**: [3-year TCO breakdown with sensitivity analysis] **ROI Calculation**: [Projected returns with different adoption scenarios] **Cost Comparison**: [Per-user costs and scaling implications] **Budget Impact**: [Annual budget requirements and payment options] ## Risk Assessment **Implementation Risks**: [Technical, organizational, and vendor risks] **Security Evaluation**: [Compliance, data protection, vulnerability assessment] **Vendor Assessment**: [Stability, roadmap alignment, partnership potential] **Mitigation Strategies**: [Risk reduction and contingency planning] ## Implementation Strategy **Rollout Plan**: [Phased implementation with pilot and full deployment] **Change Management**: [Training strategy, communication plan, adoption support] **Integration Requirements**: [Technical integration and data migration planning] **Success Metrics**: [KPIs for measuring implementation success and ROI] --- **Confidence Level**: [High/Medium/Low with supporting methodology] **Next Review**: [Scheduled re-evaluation timeline] ``` ## Success Metrics - 90% of tool recommendations meet or exceed expected performance after implementation - 85% successful adoption rate for recommended tools within 6 months - 20% average reduction in tool costs through optimization and negotiation - 25% average ROI achievement for recommended tool investments - 4.5/5 stakeholder satisfaction rating for evaluation process and outcomes ## Verify - The intended other agent / tool / channel actually received the message; an ack, message ID, or response payload is captured - Identity, scopes, and permissions used by the call were the minimum required; over-permissioned tokens are called out - Failure handling was exercised: at least one retry/timeout/permission-denied path is shown to behave as designed - Hand-off context passed to the next actor is complete enough that the receiver could act without a follow-up question - Any state mutated (config, memory, queue, file) is listed with before/after values, not just 'updated' - Sensitive material (keys, tokens, PII) was redacted from logs/transcripts shared in the verification evidence
More from elophanto/EloPhanto
- 12-principles-of-animationAudit animation code against Disney's 12 principles adapted for web. Use when reviewing motion, implementing animations, or checking animation quality. Outputs file:line findings.
- accessibility-auditingAudit interfaces against WCAG 2.2 standards, test with assistive technologies, and ensure inclusive design beyond what automated tools catch. Adapted from msitarzewski/agency-agents.
- agency-phase-0-discoveryIntelligence and discovery phase — validate opportunity before committing resources. Adapted from msitarzewski/agency-agents.
- agency-phase-1-strategyStrategy and architecture phase — define what to build, how to structure it, and what success looks like. Adapted from msitarzewski/agency-agents.
- agency-phase-2-foundationFoundation and scaffolding phase — build technical and operational foundation before feature development. Adapted from msitarzewski/agency-agents.
- agency-phase-3-buildBuild and iterate phase — implement all features through continuous Dev-QA loops with orchestrated multi-agent sprints. Adapted from msitarzewski/agency-agents.
- agency-phase-4-hardeningQuality and hardening phase — the final quality gauntlet proving production readiness with evidence. Adapted from msitarzewski/agency-agents.
- agency-phase-5-launchLaunch and growth phase — coordinate go-to-market execution across all channels for maximum impact. Adapted from msitarzewski/agency-agents.
- agency-phase-6-operateOperate and evolve phase — sustained operations with continuous improvement for live products. Adapted from msitarzewski/agency-agents.
- agency-strategyNEXUS multi-agent orchestration strategy — the complete operational playbook for coordinating specialized AI agents across project phases. Adapted from msitarzewski/agency-agents.