<?xml version="1.0" encoding="utf-8"?><rss version="2.0" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/">
<channel>
<title>滑铁卢华人, 水城百事论坛</title>
<link>http://www.kwcg.ca/forum/</link>
<description>水城社区-Kitchener, Waterloo, Cambridge &amp; Guelph华人的网上家园</description>
<language>en</language>
<item>
<title>他们自带2B标签，这是一种冗余</title>
<content:encoded><![CDATA[<p><em>Reply by 匿名, Thursday, July 02, 2026, 10:38:</em></p><p>[ No text ]</p>]]></content:encoded>
<link>http://www.kwcg.ca/forum/index.php?id=1535653</link>
<guid>http://www.kwcg.ca/forum/index.php?id=1535653</guid>
<pubDate>Thu, 02 Jul 2026 10:38:23 GMT</pubDate>
<dc:creator>匿名</dc:creator>
</item>
<item>
<title>千亿级别大模型训练完之后文件尺寸都不小。FP32：≈650GB，FP16：≈325GB，
INT8：≈162GB，INT4：≈81GB。本机装的就几个G的文件，就不要指望有多么好的推理结果了</title>
<content:encoded><![CDATA[<p><em>Reply by 匿名, Thursday, July 02, 2026, 09:47:</em></p><p>[ No text ]</p>]]></content:encoded>
<link>http://www.kwcg.ca/forum/index.php?id=1535651</link>
<guid>http://www.kwcg.ca/forum/index.php?id=1535651</guid>
<pubDate>Thu, 02 Jul 2026 09:47:06 GMT</pubDate>
<dc:creator>匿名</dc:creator>
</item>
<item>
<title>2B都没有。很多明明是hardcode的逻辑，还楞说自己是大模型。</title>
<content:encoded><![CDATA[<p><em>Reply by 匿名, Thursday, July 02, 2026, 09:38:</em></p><p>[ No text ]</p>]]></content:encoded>
<link>http://www.kwcg.ca/forum/index.php?id=1535650</link>
<guid>http://www.kwcg.ca/forum/index.php?id=1535650</guid>
<pubDate>Thu, 02 Jul 2026 09:38:36 GMT</pubDate>
<dc:creator>匿名</dc:creator>
</item>
<item>
<title>这个描述，估计它们脑子处理不了:-D :-D :-D</title>
<content:encoded><![CDATA[<p><em>Reply by 匿名, Thursday, July 02, 2026, 09:19:</em></p><p>[ No text ]</p>]]></content:encoded>
<link>http://www.kwcg.ca/forum/index.php?id=1535649</link>
<guid>http://www.kwcg.ca/forum/index.php?id=1535649</guid>
<pubDate>Thu, 02 Jul 2026 09:19:54 GMT</pubDate>
<dc:creator>匿名</dc:creator>
</item>
<item>
<title>哈哈哈，他们那点脑容量，可能连小模型都不如，不过2B是有的</title>
<content:encoded><![CDATA[<p><em>Reply by 匿名, Thursday, July 02, 2026, 09:06:</em></p><p>[ No text ]</p>]]></content:encoded>
<link>http://www.kwcg.ca/forum/index.php?id=1535647</link>
<guid>http://www.kwcg.ca/forum/index.php?id=1535647</guid>
<pubDate>Thu, 02 Jul 2026 09:06:12 GMT</pubDate>
<dc:creator>匿名</dc:creator>
</item>
<item>
<title>发现很多人其实是本地小模型特点如下：
- 参数量极小：可能是2B
- 训练集极小：甚至大部分都是污染数据
-预训练轮数少：通常不收敛
- 上下文极小：不超过500字
- 注意力：是稀疏的
- 联网搜索：是不会的
- 思维链：是没有的
- 输出：不是幻觉就是过拟合</title>
<content:encoded><![CDATA[<p><em>Posting by 匿名, Thursday, July 02, 2026, 08:11:</em></p><p>[ No text ]</p>]]></content:encoded>
<link>http://www.kwcg.ca/forum/index.php?id=1535641</link>
<guid>http://www.kwcg.ca/forum/index.php?id=1535641</guid>
<pubDate>Thu, 02 Jul 2026 08:11:42 GMT</pubDate>
<dc:creator>匿名</dc:creator>
</item>
</channel>
</rss>