{"id":13513,"date":"2025-12-19T03:16:27","date_gmt":"2025-12-18T19:16:27","guid":{"rendered":"https:\/\/ihower.tw\/blog\/?p=13513"},"modified":"2025-12-19T18:48:26","modified_gmt":"2025-12-19T10:48:26","slug":"agent-design-is-still-hard-2025","status":"publish","type":"post","link":"https:\/\/ihower.tw\/blog\/13513-agent-design-is-still-hard-2025","title":{"rendered":"AI Agent \u7522\u54c1\u958b\u767c\u4ecd\u7136\u4e0d\u7c21\u55ae"},"content":{"rendered":"\n<figure class=\"wp-block-image size-large is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"806\" data-attachment-id=\"13533\" data-permalink=\"https:\/\/ihower.tw\/blog\/13513-agent-design-is-still-hard-2025\/image-35\" data-orig-file=\"https:\/\/ihower.tw\/blog\/wp-content\/uploads\/2025\/12\/image-3.png\" data-orig-size=\"1600,1260\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"image\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/ihower.tw\/blog\/wp-content\/uploads\/2025\/12\/image-3-300x236.png\" data-large-file=\"https:\/\/ihower.tw\/blog\/wp-content\/uploads\/2025\/12\/image-3-1024x806.png\" src=\"https:\/\/ihower.tw\/blog\/wp-content\/uploads\/2025\/12\/image-3-1024x806.png\" alt=\"\" class=\"wp-image-13533\" style=\"aspect-ratio:1.2705103359173127;width:767px;height:auto\" srcset=\"https:\/\/ihower.tw\/blog\/wp-content\/uploads\/2025\/12\/image-3-1024x806.png 1024w, https:\/\/ihower.tw\/blog\/wp-content\/uploads\/2025\/12\/image-3-300x236.png 300w, https:\/\/ihower.tw\/blog\/wp-content\/uploads\/2025\/12\/image-3-768x605.png 768w, https:\/\/ihower.tw\/blog\/wp-content\/uploads\/2025\/12\/image-3-1536x1210.png 1536w, https:\/\/ihower.tw\/blog\/wp-content\/uploads\/2025\/12\/image-3-1568x1235.png 1568w, https:\/\/ihower.tw\/blog\/wp-content\/uploads\/2025\/12\/image-3.png 1600w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<p>\u5728\u8b1b\u5b8c WebConf \u4e4b\u5f8c\uff0c\u6211\u6709\u7a2e\u83ab\u540d\u7684\u4e0d\u5354\u8abf\u611f: \u4e00\u65b9\u9762 Vibe Coding \u8b93\u5927\u5bb6\u5beb\u7a0b\u5f0f\u8b8a\u7c21\u55ae\u4e86\uff0c\u4eba\u4eba\u90fd\u53ef\u4ee5\u505a App \u4e86\uff0c\u4e5f\u5f88\u591a\u4eba\u8b1b\u786c\u6280\u80fd\u4e0d\u91cd\u8981\u4e86\u3002\u4f46\u53e6\u4e00\u65b9\u9762\uff0c\u6211\u89ba\u5f97\u958b\u767c AI Agent \u7522\u54c1\u4ecd\u662f\u975e\u5e38\u6709\u6280\u8853\u6311\u6230\u6027\u7684\uff0c\u9700\u8981\u7684\u77e5\u8b58\u6280\u80fd\u6df1\u5ea6\u5ee3\u5ea6\u4e00\u9ede\u90fd\u4e0d\u5c11\u3002<\/p>\n\n\n\n<p>\u6700\u8fd1\u4e5f\u770b\u5230\u4e86\u5e7e\u7bc7\u95dc\u65bc AI Agent \u958b\u767c\u7684\u6587\u7ae0\uff0c\u767c\u73fe\u570b\u5916\u6280\u8853\u793e\u7fa4\u5728 2025 Q4 \u4e5f\u6709\u985e\u4f3c\u7684\u9ad4\u609f: Agent \u7522\u54c1\u958b\u767c\u8a2d\u8a08\u9084\u662f\u5f88\u96e3\u3002<\/p>\n\n\n\n<p>\u4e0d\u662f\u300c\u5beb\u7a0b\u5f0f\u5f88\u96e3\u300d\u90a3\u7a2e\u96e3\uff0c\u800c\u662f\u300c95% \u7684 AI Agent \u7522\u54c1\uff0c\u9032\u5230\u6b63\u5f0f\u74b0\u5883\u6703\u5931\u6557\u300d\u9019\u7a2e\u96e3\u3002\u554f\u984c\u4e0d\u5728\u6a21\u578b\u4e0d\u5920\u8070\u660e\uff0c\u800c\u5728\u65bc\u5468\u908a\u7684\u5de5\u7a0b\u67b6\u69cb: context \u7ba1\u7406\u3001memoy \u8a2d\u8a08\u3001\u932f\u8aa4\u8655\u7406\u3001agent prompt \u6700\u4f73\u5316\u3001\u8a9e\u610f\u6aa2\u7d22\u3001\u8a55\u4f30\u56de\u994b\u6a5f\u5236\u7b49\u7b49\uff0c\u5f88\u591a\u90fd\u662f\u5168\u65b0\u9818\u57df\uff0c\u4e14\u6230\u4e14\u8d70\u7684\u60c5\u6cc1\u3002\u6a21\u578b\u53ea\u80fd\u7528\u5e7e\u500b\u6708\u5c31\u8981\u5347\u7d1a\u66f4\u63db\uff0c\u5e7e\u500b\u6708\u524d\u7684 best practice \u4e5f\u53ef\u80fd\u6703\u88ab\u63a8\u7ffb\u91cd\u65b0\u601d\u8003\u3002<\/p>\n\n\n\n<p>\u7e3d\u4e4b\uff0c\u4ee5\u4e0b\u6211\u6574\u7406\u5e74\u5e95\u56db\u7bc7\u6211\u89ba\u5f97\u95dc\u65bc Agent \u958b\u767c\u6c1b\u570d\u7684\u4e0d\u932f\u6587\u7ae0:<\/p>\n\n\n\n<!--more-->\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<p><strong>\ud83d\udd39 \u5be6\u6230\u8e29\u5751: Flask \u4f5c\u8005\u7684 Agent \u958b\u767c\u5fc3\u5f97<\/strong><\/p>\n\n\n\n<p><a href=\"https:\/\/lucumr.pocoo.org\/2025\/11\/21\/agents-are-hard\">Agent Design Is Still Hard<\/a><\/p>\n\n\n\n<p>Armin Ronacher (Flask \u6846\u67b6\u4f5c\u8005) \u5206\u4eab\u4e86\u4ed6\u958b\u767c Agent \u7684\u7d93\u9a57\uff0c\u5e7e\u500b\u5be6\u6230\u89c0\u5bdf:<\/p>\n\n\n\n<p>\u95dc\u65bc SDK \u9078\u64c7: \u4ed6\u5011\u539f\u672c\u7528 Vercel AI SDK\uff0c\u73fe\u5728\u4e0d\u6703\u518d\u9019\u6a23\u9078\u4e86\u3002\u5404\u6a21\u578b\u5dee\u7570\u592a\u5927\uff0c\u7528 Anthropic \u6216 OpenAI \u539f\u751f SDK \u53cd\u800c\u66f4\u597d\u63a7\u5236\u3002\u9ad8\u968e\u62bd\u8c61\u807d\u8d77\u4f86\u5f88\u7f8e\u597d\uff0c\u4f46\u6700\u7d42\u9084\u662f\u5f97\u81ea\u5df1\u5efa\u7acb agent \u7684\u62bd\u8c61\u5c64\u3002<\/p>\n\n\n\n<p>\u95dc\u65bc\u5feb\u53d6\u7ba1\u7406(Prompt Caching): Anthropic \u8981\u6c42\u986f\u5f0f\u7ba1\u7406\u5feb\u53d6\u9ede\uff0c\u4e00\u958b\u59cb\u89ba\u5f97\u5f88\u8822\uff0c\u70ba\u4ec0\u9ebc\u5e73\u53f0\u4e0d\u81ea\u52d5\u8655\u7406? \u4f46\u5f8c\u4f86\u5b8c\u5168\u6539\u89c0\uff0c\u986f\u5f0f\u7ba1\u7406\u8b93\u6210\u672c\u548c\u5feb\u53d6\u5229\u7528\u7387\u66f4\u53ef\u9810\u6e2c\uff0c\u9084\u80fd\u505a\u5230 context \u7de8\u8f2f\u548c\u5c0d\u8a71\u5206\u652f\u9019\u4e9b\u9032\u968e\u64cd\u4f5c\u3002<\/p>\n\n\n\n<p>\u95dc\u65bc Reinforcement (\u589e\u5f37\u56de\u994b): \u6bcf\u6b21 Agent \u57f7\u884c\u5b8c\u5de5\u5177\u5f8c\uff0c\u4e0d\u53ea\u662f\u56de\u50b3\u8cc7\u6599\uff0c\u9084\u53ef\u4ee5\u585e\u66f4\u591a\u8cc7\u8a0a\u9032\u53bb: \u63d0\u9192\u6574\u9ad4\u76ee\u6a19\u3001\u4efb\u52d9\u72c0\u614b\u3001\u5931\u6557\u6642\u7d66\u63d0\u793a\u3002\u9019\u500b\u300c\u589e\u5f37\u300d\u6a5f\u5236\u6bd4\u60f3\u50cf\u4e2d\u66f4\u91cd\u8981\u3002<\/p>\n\n\n\n<p>\u95dc\u65bc\u932f\u8aa4\u8655\u7406: \u5982\u679c\u9810\u671f\u6703\u6709\u5f88\u591a\u5931\u6557\uff0c\u53ef\u4ee5\u7528\u5b50 agent \u8dd1\u5230\u6210\u529f\u70ba\u6b62\uff0c\u53ea\u56de\u5831\u6210\u529f\u7d50\u679c\u3002\u4f46\u8b93 agent \u77e5\u9053\u300c\u4ec0\u9ebc\u65b9\u6cd5\u6c92\u7528\u300d\u4e5f\u5f88\u91cd\u8981\uff0c\u80fd\u5e6b\u52a9\u4e0b\u4e00\u6b65\u907f\u958b\u540c\u6a23\u7684\u5751\u3002<\/p>\n\n\n\n<p>\u95dc\u65bc\u5171\u4eab\u72c0\u614b: \u591a\u6578 agent \u9700\u8981\u4e00\u500b\u5171\u540c\u5b58\u653e\u8cc7\u6599\u7684\u5730\u65b9\u3002\u4ed6\u5011\u9078\u64c7\u7528\u865b\u64ec\u6a94\u6848\u7cfb\u7d71\uff0c\u9019\u6a23\u4e0d\u540c\u5de5\u5177\u548c\u5b50 agent \u90fd\u80fd\u5b58\u53d6\u540c\u4e00\u4efd\u8cc7\u6599\uff0c\u907f\u514d\u8cc7\u6599\u5b64\u5cf6\u3002<\/p>\n\n\n\n<p>\u95dc\u65bc\u6e2c\u8a66: Testing \u548c Evals \u662f\u6700\u96e3\u7684\u90e8\u5206\uff0c\u76ee\u524d\u9084\u6c92\u627e\u5230\u6eff\u610f\u7684\u65b9\u6848\u3002Agent \u7684\u7279\u6027\u8b93\u50b3\u7d71\u6e2c\u8a66\u65b9\u6cd5\u90fd\u4e0d\u592a\u9069\u7528\u3002<\/p>\n\n\n\n<p>\u4ed6\u6700\u5f8c\u88dc\u4e86\u4e00\u6bb5\u6211\u5f88\u559c\u6b61: \u300c\u5982\u679c\u4f60\u6839\u672c\u4e0d\u9700\u8981 MCP \u5462?\u300d\u5f88\u591a MCP server \u904e\u5ea6\u8a2d\u8a08\uff0c\u585e\u4e86\u4e00\u5806\u5de5\u5177\u5403\u6389\u5927\u91cf context\uff0c\u5176\u5be6\u7528\u7c21\u55ae\u7684 CLI \u5de5\u5177\u900f\u904e Bash \u57f7\u884c\u5c31\u597d\u3002<\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<p><strong>\ud83d\udd39 \u70ba\u4ec0\u9ebc 95% \u7684 Agent \u5728 Production \u5931\u6557?<\/strong><\/p>\n\n\n\n<p><a href=\"https:\/\/www.motivenotes.ai\/p\/what-makes-5-of-ai-agents-actually\">What Makes 5% of AI Agents Actually Work in Production?<\/a><\/p>\n\n\n\n<p>\u9019\u7bc7\u4f86\u81ea\u4e00\u5834\u820a\u91d1\u5c71\u6d3b\u52d5\u7684\u5ea7\u8ac7\u7b46\u8a18\uff0c\u6709\u53e5\u8a71\u5f88\u4e2d\u80af: \u300c\u5927\u591a\u6578\u5275\u8fa6\u4eba\u4ee5\u70ba\u81ea\u5df1\u5728\u505a AI \u7522\u54c1\uff0c\u5176\u5be6\u662f\u5728\u505a context selection \u4e0a\u4e0b\u6587\u9078\u64c7\u7cfb\u7d71\u3002\u300d<\/p>\n\n\n\n<p>Context Engineering \u4e0d\u7b49\u65bc Prompt \u6280\u5de7: RAG \u505a\u5f97\u597d\u5176\u5be6\u5c31\u5920\u7528\uff0c\u4e0d\u592a\u9700\u8981 fine-tuning\u3002\u4f46\u591a\u6578 RAG \u7cfb\u7d71\u592a\u5929\u771f &#8211; \u7d22\u5f15\u592a\u591a\u6703\u8b93\u6a21\u578b\u6df7\u4e82\uff0c\u7d22\u5f15\u592a\u5c11\u53c8\u7f3a\u4e4f\u8a0a\u865f\u3002\u9032\u968e\u7684 context \u5de5\u7a0b\u66f4\u50cf\u662f\u300c\u7d66 LLM \u505a\u7279\u5fb5\u5de5\u7a0b\u300d: \u9078\u64c7\u6027\u88c1\u526a\u3001\u9a57\u8b49\u3001\u53ef\u89c0\u6e2c\u6027\u90fd\u662f\u529f\u592b\u3002<\/p>\n\n\n\n<p>Text-to-SQL \u7684\u6b98\u9177\u73fe\u5be6: \u4e3b\u6301\u4eba\u554f\u300c\u6709\u591a\u5c11\u4eba\u628a text-to-SQL \u505a\u5230\u6b63\u5f0f\u74b0\u5883?\u300d\u7d50\u679c\u6c92\u4eba\u8209\u624b\u3002\u4e0d\u662f\u9019\u554f\u984c\u592a\u5c0f\u773e\uff0c\u800c\u662f\u67e5\u8a62\u7406\u89e3\u771f\u7684\u8d85\u96e3 &#8211; \u81ea\u7136\u8a9e\u8a00\u6709\u6b67\u7fa9\uff0c\u5546\u696d\u8853\u8a9e\u662f\u9818\u57df\u5c08\u5c6c\u7684\uff0cLLM \u4e0d\u77e5\u9053\u4f60\u516c\u53f8\u5b9a\u7fa9\u7684\u300c\u71df\u6536\u300d\u6216\u300c\u6d3b\u8e8d\u7528\u6236\u300d\u662f\u4ec0\u9ebc\u610f\u601d\u3002\u6210\u529f\u7684\u5718\u968a\u6703\u5efa\u7acb\u5546\u696d\u8a5e\u5f59\u8868\u3001\u67e5\u8a62\u6a21\u677f\u3001\u9a57\u8b49\u5c64\u548c\u56de\u994b\u8ff4\u5708\u3002<\/p>\n\n\n\n<p>\u4fe1\u4efb\u554f\u984c\u662f\u4eba\u7684\u554f\u984c\uff0c\u4e0d\u662f\u6280\u8853\u554f\u984c: \u6709\u4f4d\u8b1b\u8005\u8aaa\u4ed6\u8001\u5a46\u4e0d\u8b93\u4ed6\u7528 Tesla \u81ea\u52d5\u99d5\u99db\uff0c\u4e0d\u662f\u56e0\u70ba\u5b83\u4e0d\u884c\uff0c\u800c\u662f\u5979\u4e0d\u4fe1\u4efb\u3002\u4f01\u696d AI \u4e5f\u4e00\u6a23\u3002\u90a3\u6210\u529f\u7684 5% agent \u6709\u4ec0\u9ebc\u5171\u540c\u9ede? \u90fd\u6709\u4eba\u6a5f\u5354\u4f5c\u8a2d\u8a08\uff0c\u8b93 AI \u7576\u52a9\u624b\u800c\u4e0d\u662f\u81ea\u4e3b\u6c7a\u7b56\u8005\uff0c\u4e26\u4e14\u5efa\u7acb\u56de\u994b\u8ff4\u5708\u8b93\u7cfb\u7d71\u5f9e\u4fee\u6b63\u4e2d\u5b78\u7fd2\u3002<\/p>\n\n\n\n<p>\u8a18\u61b6\u4e0d\u53ea\u662f\u5132\u5b58\uff0c\u662f\u67b6\u69cb\u6c7a\u7b56: \u5927\u5bb6\u90fd\u60f3\u300c\u52a0\u8a18\u61b6\u529f\u80fd\u300d\uff0c\u4f46\u8a18\u61b6\u662f\u8a2d\u8a08\u6c7a\u7b56\uff0c\u8981\u5340\u5206\u7528\u6236\u5c64\u7d1a\u3001\u5718\u968a\u5c64\u7d1a\u3001\u7d44\u7e54\u5c64\u7d1a\u3002\u800c\u4e14\u4ec0\u9ebc\u6642\u5019\u300c\u500b\u4eba\u5316\u300d\u6703\u8b8a\u6210\u300c\u4fb5\u72af\u96b1\u79c1\u300d? \u6709\u8b1b\u8005\u8aaa ChatGPT \u63a8\u85a6\u5bb6\u5ead\u96fb\u5f71\u6642\u76f4\u63a5\u53eb\u51fa\u4ed6\u5c0f\u5b69\u7684\u540d\u5b57\uff0c\u4ed6\u7684\u53cd\u61c9\u662f: \u300c\u5225\u78b0\u6211\u7684\u96b1\u79c1\u3002\u300d\u9019\u4e2d\u9593\u7684\u5e73\u8861\u5f88\u5fae\u5999\u3002<\/p>\n\n\n\n<p>\u591a\u6a21\u578b\u8abf\u5ea6: \u6b63\u5f0f\u74b0\u5883\u4e0d\u6703\u6240\u6709\u6771\u897f\u90fd\u4e1f\u7d66\u6700\u5f37\u6700\u8cb4\u6a21\u578b\u3002\u5718\u968a\u6703\u6839\u64da\u4efb\u52d9\u8907\u96dc\u5ea6\u3001\u5ef6\u9072\u8981\u6c42\u3001\u6210\u672c\u654f\u611f\u5ea6\u4f86\u505a\u6a21\u578b\u8def\u7531: \u7c21\u55ae\u554f\u984c\u7528\u5c0f\u5feb\u6a21\u578b\uff0c\u8907\u96dc\u63a8\u7406\u624d\u7528\u9802\u7d1a\u6a21\u578b\u3002\u800c\u4e14\u54ea\u500b\u67e5\u8a62\u9069\u5408\u54ea\u500b\u6a21\u578b\uff0c\u9019\u500b\u9078\u64c7\u672c\u8eab\u4e5f\u53ef\u4ee5\u96a8\u6642\u9593\u5b78\u7fd2\u512a\u5316\u3002<\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<p><strong>\ud83d\udd39 Agent \u80fd\u529b\u7684\u91d1\u5b57\u5854\u5c64\u7d1a<\/strong><\/p>\n\n\n\n<p><a href=\"https:\/\/surgehq.ai\/blog\/rl-envs-real-world\">RL Environments and the Hierarchy of Agentic Capabilities<\/a><\/p>\n\n\n\n<figure class=\"wp-block-image size-large is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"564\" data-attachment-id=\"13514\" data-permalink=\"https:\/\/ihower.tw\/blog\/13513-agent-design-is-still-hard-2025\/image-32\" data-orig-file=\"https:\/\/ihower.tw\/blog\/wp-content\/uploads\/2025\/12\/image.png\" data-orig-size=\"1080,595\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"image\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/ihower.tw\/blog\/wp-content\/uploads\/2025\/12\/image-300x165.png\" data-large-file=\"https:\/\/ihower.tw\/blog\/wp-content\/uploads\/2025\/12\/image-1024x564.png\" src=\"https:\/\/ihower.tw\/blog\/wp-content\/uploads\/2025\/12\/image-1024x564.png\" alt=\"\" class=\"wp-image-13514\" style=\"aspect-ratio:1.815661061655127;width:816px;height:auto\" srcset=\"https:\/\/ihower.tw\/blog\/wp-content\/uploads\/2025\/12\/image-1024x564.png 1024w, https:\/\/ihower.tw\/blog\/wp-content\/uploads\/2025\/12\/image-300x165.png 300w, https:\/\/ihower.tw\/blog\/wp-content\/uploads\/2025\/12\/image-768x423.png 768w, https:\/\/ihower.tw\/blog\/wp-content\/uploads\/2025\/12\/image.png 1080w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<p>Surge AI \u628a 9 \u500b\u9802\u7d1a\u6a21\u578b\u4e1f\u9032\u6a21\u64ec\u8077\u5834\u74b0\u5883\uff0c\u7d66 150 \u500b\u5ba2\u670d\u4efb\u52d9\u3002\u7d50\u679c? \u5373\u4f7f\u662f GPT-5 \u548c Claude Sonnet 4.5 \u4e5f\u6709\u8d85\u904e 40% \u7684\u4efb\u52d9\u5931\u6557\u3002<\/p>\n\n\n\n<p>\u4ed6\u5011\u5f9e\u5931\u6557\u6a21\u5f0f\u4e2d\u6b78\u7d0d\u51fa\u300cAgent \u80fd\u529b\u5c64\u7d1a\u300d\u91d1\u5b57\u5854:<\/p>\n\n\n\n<p>\u7b2c\u4e00\u5c64: \u57fa\u672c\u5de5\u5177\u4f7f\u7528\u8207\u898f\u5283: \u80fd\u628a\u591a\u6b65\u9a5f\u4efb\u52d9\u62c6\u89e3\u6210\u5c0f\u76ee\u6a19\u3001\u8fa8\u8b58\u8a72\u7528\u54ea\u500b\u5de5\u5177\u548c\u9806\u5e8f\u3001\u6b63\u78ba\u628a\u8cc7\u8a0a\u5c0d\u61c9\u5230\u5de5\u5177\u53c3\u6578\u3001\u4e00\u6b65\u6b65\u57f7\u884c\u4e0d\u6703\u8dd1\u6389\u3002GPT-4o\u3001Mistral Medium\u3001Nova 1 Pro \u5361\u5728\u9019\u5c64\uff0c\u9023\u57fa\u672c\u7684\u5de5\u5177\u547c\u53eb\u90fd\u6703\u51fa\u932f\uff0c\u4f8b\u5982\u628a &#8220;gold&#8221; \u7576\u6210\u5ba2\u6236 ID \u50b3\u9032\u53bb\u3002<\/p>\n\n\n\n<p>\u7b2c\u4e8c\u5c64: \u9069\u61c9\u529b: \u8a08\u756b\u78b0\u5230\u73fe\u5be6\u5c31\u5d29\u6f70\u600e\u9ebc\u8fa6? Gemini 2.5 \u548c Qwen3 \u5e38\u57f7\u884c\u6b63\u78ba\u7684\u5de5\u5177\u547c\u53eb\u9806\u5e8f\uff0c\u4f46\u9047\u5230\u554f\u984c\u4e0d\u6703\u8abf\u6574\u3002\u4f8b\u5982\u641c\u5c0b &#8220;Vortex Labs&#8221; \u6c92\u7d50\u679c (\u7cfb\u7d71\u5b58\u7684\u662f &#8220;VortexLabs&#8221; \u6c92\u7a7a\u683c)\uff0c\u5b83\u5011\u5c31\u76f4\u63a5\u56de\u5831\u627e\u4e0d\u5230\uff0c\u800c\u4e0d\u662f\u8a66\u5176\u4ed6\u641c\u5c0b\u65b9\u5f0f\u3002\u76f8\u6bd4\u4e4b\u4e0b\uff0cClaude Sonnet 4.5 \u6703\u4e3b\u52d5\u5617\u8a66\u4e0d\u540c\u7684\u641c\u5c0b\u53c3\u6578\uff0c\u9019\u6b63\u662f\u4eba\u985e\u6703\u505a\u7684\u4e8b\u3002<\/p>\n\n\n\n<p>\u7b2c\u4e09\u5c64: \u63a5\u5730\u80fd\u529b: \u4fdd\u6301\u5728\u7576\u524d\u8108\u7d61\u4e2d\uff0c\u4e0d\u8981\u4e82\u7de8 ID\u3001\u4e0d\u8981\u778e\u63b0\u4e8b\u5be6\u3002Kimi K2 \u6703\u641e\u932f\u5e74\u4efd\uff0c\u7cfb\u7d71\u63d0\u793a\u660e\u660e\u8aaa 2025 \u5e74\uff0c\u5b83\u641c\u5c0b\u6642\u537b\u7528 2024\u3002Claude Sonnet 4.5 \u6709\u6642\u4e5f\u6703\u7de8\u9020 email \u5730\u5740\uff0c\u96d6\u7136\u5b83\u80fd\u81ea\u6211\u4fee\u6b63\uff0c\u4f46\u9019\u7a2e\u812b\u96e2\u73fe\u5be6\u7684\u50be\u5411\u4ee4\u4eba\u64d4\u6182\u3002<\/p>\n\n\n\n<p>\u7b2c\u56db\u5c64: \u5e38\u8b58\u63a8\u7406: \u9019\u662f\u5206\u9694 GPT-5 \u548c\u4eba\u985e\u6c34\u6e96\u7684\u95dc\u9375\u3002\u5ba2\u6236\u8aaa\u300c\u5305\u88f9\u5e7e\u5c0f\u6642\u524d\u5230\u4e86\u300d\u8981\u6c42\u9000\u6b3e\uff0c\u9019\u660e\u986f\u662f\u9000\u8ca8\u4e0d\u662f\u53d6\u6d88\u8a02\u55ae (\u56e0\u70ba\u5df2\u7d93\u6536\u5230\u5546\u54c1\u4e86)\uff0c\u4f46 GPT-5 \u6c92\u63a8\u7406\u51fa\u4f86\u3002\u53e6\u4e00\u500b\u4f8b\u5b50\u662f\u627e\u300c\u904a\u6232\u73a9\u5bb6\u300d\u5ba2\u6236\uff0c\u5408\u7406\u505a\u6cd5\u662f\u5148\u627e\u904a\u6232\u76f8\u95dc\u7522\u54c1\u985e\u5225\u518d\u641c\u5c0b\u8a02\u55ae\uff0c\u4f46 GPT-5 \u537b\u7b28\u62d9\u5730\u9010\u65e5\u641c\u5c0b\u6574\u500b\u6708\u7684\u8a02\u55ae\u3002<\/p>\n\n\n\n<p>\u7d50\u8ad6: 2025 \u5e74\u4e0d\u662f\u300c\u6211\u5011\u5df2\u7d93\u5be6\u73fe\u5f37\u5927\u901a\u7528 agent\u300d\u7684\u4e00\u5e74\uff0c\u800c\u662f\u300cagent \u7d42\u65bc\u80fd\u5920\u7a69\u5b9a\u884c\u52d5\uff0c\u6211\u5011\u53ef\u4ee5\u958b\u59cb\u8a0e\u8ad6\u5206\u6790\u5b83\u5011\u7684\u5e38\u8b58\u63a8\u7406\u80fd\u529b\u300d\u7684\u4e00\u5e74\u3002<\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<p><strong>\ud83d\udd39 \u89e3\u65b9: Agent \u61c9\u8a72\u66f4\u6709\u4e3b\u898b<\/strong><\/p>\n\n\n\n<p><a href=\"https:\/\/www.vtrivedy.com\/posts\/agents-should-be-more-opinionated\">Agents Should Be More Opinionated<\/a><\/p>\n\n\n\n<p>\u9762\u5c0d\u9019\u4e9b\u6311\u6230\uff0c\u6709\u4e00\u500b\u7522\u54c1\u958b\u767c\u65b9\u5411\u6211\u5f88\u8a8d\u540c: \u6700\u597d\u7684 agent \u7522\u54c1\u4e0d\u662f\u6700\u6709\u5f48\u6027\u7684\uff0c\u800c\u662f\u6700\u6709\u4e3b\u898b (Opinionated) \u7684\u3002<\/p>\n\n\n\n<p>\u5f48\u6027\u9677\u9631: \u4ec0\u9ebc\u7528\u6236\u6703\u8208\u596e\u5730\u81ea\u5df1\u8abf\u6574\u6a21\u578b\u6eab\u5ea6\u548c\u5206\u584a\u7b56\u7565? \u6c92\u6709\u3002\u4ee5\u70ba\u7528\u6236\u60f3\u8981\u9078\u64c7\uff0c\u5176\u5be6\u4ed6\u5011\u60f3\u8981\u7d50\u679c\u3002Steve Jobs \u548c iPhone \u5c31\u662f\u6700\u597d\u7684\u4f8b\u5b50: \u4e00\u500b\u6309\u9215\u3001\u4e00\u500b\u87a2\u5e55\uff0c\u4f46\u529f\u80fd\u6c92\u6709\u4efb\u4f55\u9650\u5236\uff0c\u9b54\u6cd5\u5728\u65bc\u7522\u54c1\u5f9e\u5c11\u6578\u4e92\u52d5\u9ede\u5c31\u80fd\u53ef\u9760\u904b\u4f5c\u3002<\/p>\n\n\n\n<p>\u66ff\u7528\u6236\u505a\u5927\u91cf\u524d\u7f6e\u5de5\u4f5c:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>\u6e2c\u8a66\u6bcf\u500b\u6a21\u578b\uff0c\u6240\u4ee5\u7528\u6236\u4e0d\u7528\u6e2c (\u4e0d\u8981\u76f8\u4fe1 benchmark\uff0c\u8981\u5728\u4f60\u7684\u771f\u5be6\u5834\u666f\u6e2c\u8a66)<\/li>\n\n\n\n<li>\u5beb\u8a73\u7d30\u7684 prompt \u544a\u8a34 agent \u6210\u529f\u9577\u4ec0\u9ebc\u6a23\u3001\u600e\u9ebc\u9054\u6210<\/li>\n\n\n\n<li>\u6bcf\u500b\u5fc5\u586b\u7684\u7528\u6236\u9078\u9805\uff0c\u90fd\u4ee3\u8868\u4f60\u6c92\u66ff\u7528\u6236\u505a\u597d\u6c7a\u5b9a<\/li>\n<\/ul>\n\n\n\n<p>\u6a21\u578b\u5728\u6846\u67b6\u88e1\u662f\u4e0d\u53ef\u66ff\u63db\u7684: \u4f60\u6c92\u8fa6\u6cd5\u812b\u96e2\u6846\u67b6\u4f86\u8a55\u4f30\u6a21\u578b\u3002\u6a21\u578b\u667a\u529b\u662f\u300c\u5c16\u523a\u72c0\u300d\u7684\uff0c\u7576\u4f60\u8a2d\u8a08\u6846\u67b6\u6642\uff0c\u4f60\u96b1\u542b\u5730\u5728\u7e5e\u904e\u6a21\u578b\u7684\u5f37\u9805\u548c\u5f31\u9805\u8a2d\u8a08\u3002\u6240\u4ee5\u300c\u5347\u7d1a\u300d\u5230\u65b0\u6a21\u578b\u5e38\u5e38\u6703\u6253\u58de\u73fe\u6709\u6846\u67b6\u3002\u552f\u4e00\u91cd\u8981\u7684\u554f\u984c\u662f: \u9019\u500b\u6846\u67b6 + \u6a21\u578b\u7d44\u5408\uff0c\u5728\u6211\u7684\u4efb\u52d9\u4e0a\u6210\u529f\u55ce?<\/p>\n\n\n\n<p>\u5f9e\u6df1\u4e14\u7a84\u958b\u59cb: \u5bec\u6cdb\u7684 agent \u60f3\u8655\u7406\u592a\u591a\u7a2e\u4efb\u52d9\uff0cdemo \u5f88\u53b2\u5bb3\u4f46\u6b63\u5f0f\u74b0\u5883\u5f88\u6158\uff0c\u56e0\u70ba\u6bcf\u591a\u4e00\u500b\u529f\u80fd\u5c31\u591a\u4e00\u5806 bug \u548c\u908a\u754c\u60c5\u6cc1\u3002\u6dfa\u8584\u7684 agent \u53c8\u4e0d\u5920\u8907\u96dc\uff0c\u6839\u672c\u4e0d\u8a72\u662f agent\u3002\u751c\u871c\u9ede\u662f\u5920\u7a84\u53ef\u4ee5\u5fb9\u5e95\u512a\u5316\uff0c\u53c8\u5920\u6df1\u8b93\u8907\u96dc\u5ea6\u503c\u5f97\u6295\u8cc7\u3002\u5148\u627e\u51fa\u90a3 10% \u80fd\u7522\u751f\u6700\u5927\u50f9\u503c\u7684\u4efb\u52d9\u4f86\u505a agent\uff0c\u5ffd\u7565\u5176\u4ed6\u7684\u3002<\/p>\n\n\n\n<p>\u9023 Anthropic \u90fd\u5728\u8b8a\u5f97\u66f4\u6709\u4e3b\u898b: \u4ed6\u5011\u6709\u5c08\u9580\u7684\u751f\u547d\u79d1\u5b78\u548c\u91d1\u878d\u5718\u968a\uff0c\u4e0d\u662f\u70ba\u4e86\u505a\u5c08\u9580\u7684\u57fa\u790e\u6a21\u578b\uff0c\u800c\u662f\u5728\u6df1\u8015\u554f\u984c\u9818\u57df\u3001\u512a\u5316 agent \u6846\u67b6 (prompts\u3001\u5de5\u5177\u3001context\u3001sub-agent)\u3002Claude Code \u548c Codex \u9019\u4e9b\u7522\u54c1\u4e5f\u90fd\u6709\u5167\u5efa\u7684\u5de5\u5177\u548c context \u7ba1\u7406\uff0c\u800c\u4e0d\u662f\u7d66\u4f60\u4e00\u5806\u9078\u9805\u3002<\/p>\n\n\n\n<p>&#8212;<\/p>\n\n\n\n<p>\u4ee5\u4e0a\u56db\u7bc7\u6587\u7ae0\u5206\u4eab\uff0c\u7b97\u662f 2025 \u6b72\u672b AI Agent \u958b\u767c\u7684\u73fe\u6cc1\u3002\u4f7f\u7528 Claude Code\u3001Codex\u3001Cursor \u9019\u4e9b Coding Agent \u4f86\u5beb code \u78ba\u5be6\u5f88\u723d\uff0c\u4f46\u5225\u5fd8\u4e86\u9019\u4e9b\u662f\u76ee\u524d\u6700\u5f37\u7684 AI \u516c\u53f8\u50be\u529b\u6253\u9020\u7684\u7522\u54c1\uff0c\u800c\u8981\u6211\u5011\u81ea\u5df1\u8981\u958b\u767c Agent \u7684\u6642\u5019\uff0c\u6311\u6230\u624d\u6b63\u8981\u958b\u59cb\u3002<br><br><\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n","protected":false},"excerpt":{"rendered":"<p>\u5728\u8b1b\u5b8c WebConf \u4e4b\u5f8c\uff0c\u6211\u6709\u7a2e\u83ab\u540d\u7684\u4e0d\u5354\u8abf\u611f: \u4e00\u65b9\u9762 Vibe Coding \u8b93\u5927\u5bb6\u5beb\u7a0b\u5f0f\u8b8a\u7c21\u55ae\u4e86\uff0c\u4eba &hellip; <\/p>\n<p class=\"link-more\"><a href=\"https:\/\/ihower.tw\/blog\/13513-agent-design-is-still-hard-2025\" class=\"more-link\">\u95b1\u8b80\u5168\u6587<span class=\"screen-reader-text\">\u3008AI Agent \u7522\u54c1\u958b\u767c\u4ecd\u7136\u4e0d\u7c21\u55ae\u3009<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2}},"categories":[80],"tags":[],"class_list":["post-13513","post","type-post","status-publish","format-standard","hentry","category-llm","entry"],"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_shortlink":"https:\/\/wp.me\/p1q6tG-3vX","jetpack_sharing_enabled":true,"jetpack_likes_enabled":true,"_links":{"self":[{"href":"https:\/\/ihower.tw\/blog\/wp-json\/wp\/v2\/posts\/13513","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/ihower.tw\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/ihower.tw\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/ihower.tw\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/ihower.tw\/blog\/wp-json\/wp\/v2\/comments?post=13513"}],"version-history":[{"count":26,"href":"https:\/\/ihower.tw\/blog\/wp-json\/wp\/v2\/posts\/13513\/revisions"}],"predecessor-version":[{"id":13563,"href":"https:\/\/ihower.tw\/blog\/wp-json\/wp\/v2\/posts\/13513\/revisions\/13563"}],"wp:attachment":[{"href":"https:\/\/ihower.tw\/blog\/wp-json\/wp\/v2\/media?parent=13513"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/ihower.tw\/blog\/wp-json\/wp\/v2\/categories?post=13513"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/ihower.tw\/blog\/wp-json\/wp\/v2\/tags?post=13513"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}