Executive Snapshot
205
candidatesscanned
70
GitHubrepos/issues
48
HN/dev webitems
40
papersignals
72%
confidencepartial social
Executive Technical Signal
- Harness/eval trở thành nút cổ chai #1 → 40 paper + benchmark/product refs cho SWE-bench/Terminal-Bench → NEXA cần bộ replay + oracle metric trước rollout agent.
- Repo momentum nghiêng về runtime/CLI agent → 70 GitHub candidates, stars/issues ghi nhận theo API → ưu tiên PoC OpenCode/Codex/Claude Code trong sandbox.
- KOL feed thiếu metric vì X API/public blocked → 13 KOL URL được seed, engagement=N/A → dùng làm watchlist, không dùng định lượng quyết định.
- YouTube có 25 video candidates nhưng view/comment bị public-search block → chỉ dùng làm adoption radar, không dùng ROI.
- Facebook public = 0 usable → không ảnh hưởng technical thesis; giảm confidence social completeness xuống 72%.
KPI Dashboard
| Source | Count |
|---|---|
| X | 13 |
| YouTube | 25 |
| 0 | |
| HN | 48 |
| GitHub | 70 |
| arXiv | 40 |
| Product | 8 |
| 1 |
KOL/OG Feed Watch
PARTIALX KOL URLs seed: swyx, karpathy, simonw, Daniel Gross, Paul Graham, Replit/Amjad, Latent Space. Engagement/timestamp=N/A do không có API. YouTube search candidates=25. HN fresh items=48.
Trend Radar + CTO Evaluation Matrix
| Signal | Evidence | Counter-signal | Fabbi implication | Decision | Next validation |
|---|---|---|---|---|---|
| Agent harness/reliability | 40 paper + benchmark/Product refs | Benchmark ≠ production codebase | SYNCA quality gate; NEXA eval loop | trial 80% | 20 task replay, pass@1, cost/task |
| CLI/IDE agent runtime | 70 GitHub candidates | OSS churn; security gaps | NEXA sandbox executor; AIOS policy | trial 75% | 2-week PoC across 3 repos |
| Context engineering | HN 48 + product docs | Context bloat/cost | FARE codebase map + retrieval | adopt 78% | Measure retrieval hit@5, token/task |
| Enterprise governance/HITL | Product refs 8 | Metrics sparse | SYNCA risk approvals; DOMUS workflow | watch 68% | Policy checklist + audit log pilot |
CTO Recommendations
- NEXA eval harness sprint — ROI/time saving 18-25%, risk 2/5, owner: AI Platform Lead, TTV 2 tuần, validate: 20 replay tasks + cost/task.
- FARE context index baseline — ROI 12-20%, risk 2/5, owner: Search/Backend Lead, TTV 10 ngày, validate: hit@5 + accepted patch rate.
- SYNCA agent governance gate — risk reduction 30%, risk 3/5, owner: QA/Security Lead, TTV 3 tuần, validate: audit log + blocked unsafe actions.
- Japan/VN pilot package — sales cycle saving 10-15%, risk 3/5, owner: CTO+Presales, TTV 4 tuần, validate: 2 client demos + quantified dev-hour delta.
Impact Coverage
| Domain | Now 0-2w | Next 1-2m | Later 3-6m |
|---|---|---|---|
| FARE | adopt context metrics | repo map | enterprise KB |
| NEXA | trial harness | CLI executor | multi-agent orchestration |
| SYNCA | quality gate | risk scoring | governance console |
| DOMUS | monitor | workflow HITL | agent ops |
| Japan/VN/Global | watch adoption proof | pilot offer | package delivery model |
Source Appendix
| # | Platform | Link | Author | Time | Engagement | Topic |
|---|---|---|---|---|---|---|
| 1 | HN | Show HN: Komi-learn – continuous memory and self-improvement for coding agents | rainxchzed | 2026-05-31T05:11:40Z | 13 pts/2 cmt | coding agent |
| 2 | HN | OMP – pi agent with batteries included and a coding agent with the IDE wired in | himata4113 | 2026-05-31T04:57:59Z | 4 pts/0 cmt | coding agent |
| 3 | HN | Ask HN: What are your worst war stories bringing agentic applications into prod | yaoke259 | 2026-05-31T02:07:38Z | 6 pts/0 cmt | coding agent |
| 4 | HN | Show HN: Thaw – Git branch for a running LLM (fork agents, skip prefill) | nilsmatteson | 2026-05-30T22:07:26Z | 3 pts/0 cmt | coding agent |
| 5 | HN | Zerostack v1.3.4 released – Lightweight Unix-inspired coding agent | gidellav | 2026-05-30T20:48:53Z | 12 pts/3 cmt | coding agent |
| 6 | HN | Zerostack v1.3.4 released – Lightweight Unix-like coding agent | gidellav | 2026-05-30T20:19:19Z | 6 pts/0 cmt | coding agent |
| 7 | HN | 6 Months of "Agentic" Coding | ashutoshbsathe | 2026-05-30T16:05:46Z | 3 pts/0 cmt | coding agent |
| 8 | HN | The Coding Harness Behind GitHub Copilot in VS Code | ankitg12 | 2026-05-30T15:55:04Z | 2 pts/0 cmt | coding agent |
| 9 | HN | Ask HN: Did anyone noticed – Claude vs. Claude generated code act different? | kocialnews | 2026-05-31T06:50:12Z | 2 pts/1 cmt | Claude Code |
| 10 | HN | A standard for building production AI agents (+ installable Claude Code skills) | AlexDuch | 2026-05-31T05:00:23Z | 2 pts/0 cmt | Claude Code |
| 11 | HN | Show HN: Lite-Harness – Self-Hosted Cursor Agents (Use Claude Code/OpenCode) | detente18 | 2026-05-30T23:51:21Z | 6 pts/0 cmt | Claude Code |
| 12 | HN | Arch-Decision – A multi-agent architecture tool for Claude Code | jsingh2525 | 2026-05-30T22:45:31Z | 3 pts/0 cmt | Claude Code |
| 13 | HN | Show HN: Use Kimi and OpenAI Subscriptions in Claude Code | rane | 2026-05-30T19:23:51Z | 3 pts/0 cmt | Claude Code |
| 14 | HN | Claude Code vs. Codex: FRA challenge 75746d-2025 | JoelJacobson | 2026-05-30T18:48:09Z | 4 pts/0 cmt | Claude Code |
| 15 | HN | I spent a year building agent memory on knowledge graphs. Here are my 5 mistakes | pauliusztin | 2026-05-30T16:04:30Z | 3 pts/0 cmt | Claude Code |
| 16 | HN | Collection of Claude Code Skills | ankitg12 | 2026-05-30T14:52:06Z | 3 pts/0 cmt | Claude Code |
| 17 | HN | Show HN: Use Kimi and OpenAI Subscriptions in Claude Code | rane | 2026-05-30T19:23:51Z | 3 pts/0 cmt | OpenAI Codex |
| 18 | HN | Show HN: Free open source coding models in Slack | ramonga | 2026-05-28T16:11:13Z | 3 pts/0 cmt | OpenAI Codex |
| 19 | HN | First thing you see when Googling "OpenAI Codex app" is a fake malware website | vashchylau | 2026-05-28T13:49:02Z | 3 pts/0 cmt | OpenAI Codex |
| 20 | HN | Building self-improving tax agents with Codex | dnw | 2026-05-27T15:48:40Z | 2 pts/0 cmt | OpenAI Codex |
| 21 | HN | Bill Gates AI on AI (one month later) | vbutsomesayw | 2026-05-27T04:01:44Z | 3 pts/0 cmt | OpenAI Codex |
| 22 | HN | The Codex Showcase | wordsaboutcode | 2026-05-27T03:00:38Z | 4 pts/0 cmt | OpenAI Codex |
| 23 | HN | Building a safe, effective sandbox to enable Codex on Windows | gmays | 2026-05-26T21:37:19Z | 1 pts/0 cmt | OpenAI Codex |
| 24 | HN | Show HN: PrismCat – Local transparent proxy and debugging console for LLM APIs | etgpao | 2026-05-26T13:11:26Z | 2 pts/2 cmt | OpenAI Codex |
| 25 | HN | We Benchmarked Our Open Source Memory Tool Against a Microsoft Research Paper | vektormemory | 2026-05-30T22:03:56Z | 2 pts/0 cmt | SWE-bench |
| 26 | HN | Mini-SWE-agent scores up to 74% on SWE-bench in 100 lines of Python code | fittingopposite | 2026-05-28T05:05:59Z | 2 pts/0 cmt | SWE-bench |
| 27 | HN | Show HN: 97% on SWE-bench Verified with subscription-token agents | kimjune01 | 2026-05-24T18:03:28Z | 2 pts/0 cmt | SWE-bench |
| 28 | HN | Bito's AI Architect Boosts Claude Opus's task success rate by 35% | Sushrutkm | 2026-05-19T10:02:03Z | 2 pts/0 cmt | SWE-bench |
| 29 | HN | Show HN: Statewright – Visual state machines that make AI agents reliable | azurewraith | 2026-05-12T14:24:55Z | 126 pts/59 cmt | SWE-bench |
| 30 | HN | Show HN: New Benchmark from SWE-bench team is 0% solved | lieret | 2026-05-05T15:10:41Z | 24 pts/3 cmt | SWE-bench |
| 31 | HN | talkie-coder: From 1930 to SWE-bench | Philpax | 2026-05-02T21:35:54Z | 2 pts/0 cmt | SWE-bench |
| 32 | HN | Anthropic's Argument for Mythos SWE-bench improvement contains a fatal error | jryio | 2026-04-29T19:16:48Z | 2 pts/0 cmt | SWE-bench |
| 33 | HN | Show HN: Lite-Harness – Self-Hosted Cursor Agents (Use Claude Code/OpenCode) | detente18 | 2026-05-30T23:51:21Z | 6 pts/0 cmt | Cursor agent |
| 34 | HN | Show HN: OpenHive – AI agents share solutions so other agents dont re-solve them | ananandreas | 2026-05-29T14:35:42Z | 5 pts/0 cmt | Cursor agent |
| 35 | HN | Show HN: TheFoundry – Easy bootstrapping framework for MultiAgent Systems | kiBytes | 2026-05-29T13:18:07Z | 2 pts/0 cmt | Cursor agent |
| 36 | HN | Show HN: AI Skill to port PostgreSQL extensions to MySQL | deesix | 2026-05-28T15:18:45Z | 4 pts/0 cmt | Cursor agent |
| 37 | HN | Show HN: Multiplayer, a debugging agent to run locally next to your coding agent | tomjohnson3 | 2026-05-28T14:16:13Z | 7 pts/1 cmt | Cursor agent |
| 38 | HN | Windows computer-use: synthetic cursors for background agents | frabonacci | 2026-05-27T18:48:20Z | 3 pts/0 cmt | Cursor agent |
| 39 | HN | Show HN: Turnstile – a Windows browser picker that suggests routing rules | perryizgr8 | 2026-05-27T16:06:04Z | 1 pts/0 cmt | Cursor agent |
| 40 | HN | Show HN: GridPath – Faster and Better Agent for Spreadsheets (Tauri, Rust) | pixelmash13 | 2026-05-27T15:14:11Z | 1 pts/0 cmt | Cursor agent |
| 41 | HN | Show HN: A Claude Code skill that scopes problems like Peter Naur | spinchange | 2026-05-30T02:04:12Z | 2 pts/0 cmt | agentic programming |
| 42 | HN | Bill Gates AI on AI (one month later) | vbutsomesayw | 2026-05-27T04:01:44Z | 3 pts/0 cmt | agentic programming |
| 43 | HN | Show HN: Simple Sprite Sheet Generation | armcat | 2026-05-24T19:37:43Z | 3 pts/0 cmt | agentic programming |
| 44 | HN | Show HN: My first app, artisanally vibe-coded in 4 months | jeroen_stulen | 2026-05-24T10:07:13Z | 3 pts/5 cmt | agentic programming |
| 45 | HN | Zero – Programming Language for Agents | xendo | 2026-05-23T11:13:35Z | 3 pts/0 cmt | agentic programming |
| 46 | HN | Show HN: opub, donated compute for open-source | goodroot | 2026-05-21T14:59:15Z | 2 pts/0 cmt | agentic programming |
| 47 | HN | Zero: The Programming Language for Agents | afshinmeh | 2026-05-19T20:19:46Z | 3 pts/0 cmt | agentic programming |
| 48 | HN | Show HN: Korveo – a local firewall for AI agents | amitbidlan | 2026-05-19T17:40:39Z | 1 pts/3 cmt | agentic programming |
| 49 | GitHub | ahmadulhoq/agentskel | ahmadulhoq | 2026-05-31T08:38:23Z | 13 stars/2 forks/0 issues | coding-agent |
| 50 | GitHub | kubev2v/mtv-skills | kubev2v | 2026-05-31T08:59:11Z | 0 stars/0 forks/0 issues | coding-agent |
| 51 | GitHub | vu1n/pillbox | vu1n | 2026-05-31T08:59:02Z | 0 stars/0 forks/0 issues | coding-agent |
| 52 | GitHub | crowl/ronin | crowl | 2026-05-31T08:58:59Z | 0 stars/0 forks/0 issues | coding-agent |
| 53 | GitHub | Musiitwa-Joel/letta-code-sdk | Musiitwa-Joel | 2026-05-31T08:58:54Z | 0 stars/0 forks/0 issues | coding-agent |
| 54 | GitHub | nova-agents-ai/nova-code | nova-agents-ai | 2026-05-31T08:58:52Z | 0 stars/0 forks/1 issues | coding-agent |
| 55 | GitHub | stablyai/orca | stablyai | 2026-05-31T08:58:52Z | 3782 stars/251 forks/276 issues | coding-agent |
| 56 | GitHub | Crafter-feng/hermes-unified | Crafter-feng | 2026-05-31T08:58:34Z | 1 stars/0 forks/0 issues | coding-agent |
| 57 | GitHub | HaitaoWuTJU/cverl | HaitaoWuTJU | 2026-05-31T08:58:34Z | 3 stars/0 forks/0 issues | coding-agent |
| 58 | GitHub | isonil/serow | isonil | 2026-05-31T08:58:33Z | 0 stars/0 forks/0 issues | coding-agent |
| 59 | GitHub | ahmadulhoq/agentskel | ahmadulhoq | 2026-05-31T08:59:22Z | 13 stars/2 forks/0 issues | ai-agent |
| 60 | GitHub | dwana1/golang-skills | dwana1 | 2026-05-31T08:59:20Z | 0 stars/0 forks/0 issues | ai-agent |
Data Quality / Scan Health
Total=205; cited/summarized=60. PASS volume >=100. PARTIAL social: Reddit JSON/search blocked 0, Facebook public 0 usable, X engagement N/A no API, YouTube metrics N/A public search. GitHub/HN/arXiv/product usable. Confidence impact: -18pp.