newsmode MarketNews
Прогресс перевода 90 переведено · 2013 в очереди · 238 только что найдено · всего 2341

auto_awesome Applied AI / LLM 658

search
rss_feed Hamel Husain ·05.11.2025

Using LLM-as-a-Judge For Evaluation: A Complete Guide –

descriptionтекст 6270 сл.
rss_feed Hamel Husain ·04.11.2025

Coding Agents – Hamel’s Blog - Hamel Husain

descriptionтекст
rss_feed Hamel Husain ·04.11.2025

Amp – Hamel’s Blog - Hamel Husain

descriptionтекст 867 сл.
rss_feed Ethan Mollick — One Useful Thing ·23.10.2025

Confronting Impossible Futures

descriptionтекст 2383 сл.
rss_feed Ethan Mollick — One Useful Thing ·19.10.2025

An Opinionated Guide to Using AI Right Now

descriptionтекст 2717 сл.
rss_feed Eugene Yan ·19.10.2025

Advice for New Principal Tech ICs (i.e., Notes to Myself)

descriptionтекст 2882 сл.
rss_feed Hamel Husain ·12.10.2025

Q: Can I use the same model for both the main task and evaluation? – Hamel’s Blog - Hamel Husain

descriptionтекст 208 сл.
rss_feed Hamel Husain ·12.10.2025

Your AI Product Needs Evals –

descriptionтекст 3831 сл.
rss_feed Andrej Karpathy — BearBlog ·01.10.2025

Animals vs Ghosts

descriptionтекст 1770 сл.
rss_feed Hamel Husain ·01.10.2025

Selecting The Right AI Evals Tool

descriptionтекст 1316 сл.
rss_feed Ethan Mollick — One Useful Thing ·29.09.2025

Real AI Agents and Real Work

descriptionтекст 1461 сл.
rss_feed Eugene Yan ·14.09.2025

Training an LLM-RecSys Hybrid for Steerable Recs with Semantic IDs

descriptionтекст 4921 сл.
rss_feed Ethan Mollick — One Useful Thing ·11.09.2025

On Working with Wizards

descriptionтекст 2142 сл.
rss_feed Hamel Husain ·11.09.2025

Q: How do we evaluate a model’s ability to express uncertainty or “know what it doesn’t know”? – Hamel’s Blog - Hamel Husain

descriptionтекст 265 сл.
rss_feed Hamel Husain ·11.09.2025

Q: How should I version and manage prompts? – Hamel’s Blog - Hamel Husain

descriptionтекст 370 сл.
rss_feed Ethan Mollick — One Useful Thing ·28.08.2025

Mass Intelligence

descriptionтекст 1807 сл.
rss_feed Hamel Husain ·17.08.2025

Q: How do I make the case for investing in evaluations to my team? – Hamel’s Blog - Hamel Husain

descriptionтекст 242 сл.
rss_feed Hamel Husain ·15.08.2025

(без названия)

scheduleв очереди
rss_feed Hamel Husain ·15.08.2025

(без названия)

scheduleв очереди
rss_feed Hamel Husain ·15.08.2025

(без названия)

scheduleв очереди
rss_feed Hamel Husain ·15.08.2025

Q: Should I build a custom annotation tool or use something off-the-shelf? – Hamel’s Blog - Hamel Husain

descriptionтекст 188 сл.
rss_feed Hamel Husain ·15.08.2025

Q: How do I surface problematic traces for review beyond user feedback? – Hamel’s Blog - Hamel Husain

descriptionтекст 166 сл.
rss_feed Hamel Husain ·15.08.2025

Q: How do I justify evaluation time and budget to management? – Hamel’s Blog - Hamel Husain

descriptionтекст 115 сл.
rss_feed Hamel Husain ·15.08.2025

Q: How do I evaluate complex multi-step workflows? – Hamel’s Blog - Hamel Husain

descriptionтекст 184 сл.