ChatGPT软件技术栈解密
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">ChatGPT 点燃了通用AI浪潮,继农业革命、工业革命、计算机技术革命后,<span style="color: black;">亦</span>将可能掀起 AI 技术革命。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">业界对 ChatGPT 的 AI 算法关注得比较多,<span style="color: black;">然则</span> OpenAI <span style="color: black;">已然</span>演变为服务数亿用户的平台服务。近3个月 ChatGPT 的 SLA 大约99%,<span style="color: black;">亦</span><span style="color: black;">便是</span>说平均<span style="color: black;">每日</span>大<span style="color: black;">大概</span>15分钟不可用,整体技术架构和<span style="color: black;">靠谱</span>性<span style="color: black;">亦</span>备受关注。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><img src="https://mmbiz.qpic.cn/mmbiz_png/IicVLYINBJfgzlic0nE6GFEsrQhEGdD7jicrYxn1xWlOG4Ul7p2IRaic3sRroib8UrmX9yv3x8NSEZvUmUhbYXMfwtg/640?wx_fmt=png&tp=webp&wxfrom=5&wx_lazy=1&wx_co=1" style="width: 50%; margin-bottom: 20px;"></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">OpenAI 网站近90天状态图(绿色<span style="color: black;">暗示</span>可用,其它<span style="color: black;">暗示</span><span style="color: black;">反常</span>)</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><strong style="color: blue;"><span style="color: black;">ChatGPT <span style="color: black;">无</span>对外正式分享<span style="color: black;">她们</span>的技术架构,<span style="color: black;">因此</span>很难100%准确<span style="color: black;">晓得</span>架构大图,本文尝试从以下几个方面:</span></strong></span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">互联网公开信息(twitter、linkedIn等)</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">OpenAI 最新招聘岗位<span style="color: black;">需求</span></span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">OpenAI 几次故障报告</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">Github 代码</span></p><span style="color: black;">绘制了 ChatGPT 的<span style="color: black;">重点</span>软件技术栈大图。</span><img src="https://mmbiz.qpic.cn/mmbiz_png/IicVLYINBJfgzlic0nE6GFEsrQhEGdD7jic8FCgb8g4dHoic93DBiceLQ3eBBjTrp8JU05LQDTaHblgeUdp5tHRdHUg/640?wx_fmt=png&tp=webp&wxfrom=5&wx_lazy=1&wx_co=1" style="width: 50%; margin-bottom: 20px;">
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><strong style="color: blue;">1、<span style="color: black;">云服务</span></strong></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">OpenAI 是云时代的 AI 创业<span style="color: black;">机构</span>,所有业务都是架设在公有云上,在创业初期得到 AWS 的支持,下面这个对话是 OpenAI 截止2021年公开信息的回答:</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><img src="https://mmbiz.qpic.cn/mmbiz_png/IicVLYINBJfgzlic0nE6GFEsrQhEGdD7jicwMMtJbfYwMmK8zvycic9jaqz1vMlyVpdekX47GSLM5puBem281jhFdA/640?wx_fmt=png&tp=webp&wxfrom=5&wx_lazy=1&wx_co=1" style="width: 50%; margin-bottom: 20px;"></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><img src="https://mmbiz.qpic.cn/mmbiz_png/IicVLYINBJfgzlic0nE6GFEsrQhEGdD7jiccmIQahibWEgibbBOYOat5VR9W0vqVxZhsibaicuOgfYomQsT9CTqaJaqkA/640?wx_fmt=png&tp=webp&wxfrom=5&wx_lazy=1&wx_co=1" style="width: 50%; margin-bottom: 20px;"></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">上面这段话来自 OpenAI 的官网<span style="color: black;">初期</span><span style="color: black;">文案</span>介绍,<span style="color: black;">能够</span>看到曾<span style="color: black;">运用</span>了<span style="color: black;">海量</span> AWS 服务。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">近两年又得到微软巨额投资,并且<span style="color: black;">知道</span>了要由 Azure <span style="color: black;">供给</span>服务。<span style="color: black;">能够</span>肯定 ChatGPT 是在转向以 Azure 为主的多云<span style="color: black;">方法</span>,AWS 服务会<span style="color: black;">逐步</span>减少。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">另一</span><span style="color: black;">运用</span>了 Terraform 多云管理服务来做云资源的管理。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><strong style="color: blue;">2、数据库</strong></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><img src="https://mmbiz.qpic.cn/mmbiz_png/IicVLYINBJfgzlic0nE6GFEsrQhEGdD7jicAC6baz0iccFZBDOGhkG2DpBe5W8AZONjiaiajALicsOkQ2AHQULNbmKW3Q/640?wx_fmt=png&tp=webp&wxfrom=5&wx_lazy=1&wx_co=1" style="width: 50%; margin-bottom: 20px;"></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">首要</span>问 ChatGPT 自己,基本上都<span style="color: black;">无</span>明确的答案。<span style="color: black;">不外</span>从官网职位和故障报告<span style="color: black;">能够</span>得到比较准确的信息:</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">ChatGPT 的核心业务数据<span style="color: black;">保留</span>在关系型数据库 PostgreSQL 中,这个在官方网站有介绍,数亿用户的账号、AK和对话等信息都<span style="color: black;">保留</span>在<span style="color: black;">这儿</span>。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">2023.2.20号的故障报告<span style="color: black;">亦</span>说明了主数据库 PostgreSQL <span style="color: black;">出现</span>问题。并且<span style="color: black;">说到</span><span style="color: black;">运用</span>了<span style="color: black;">PgBouncer的连接池服务。</span></span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">同期</span><span style="color: black;">运用</span>了 Redis 集群<span style="color: black;">做为</span>缓存服务,在2023.3.20 ChatGPT 爆出了安全漏洞,部分用户<span style="color: black;">能够</span>看到其他人的聊天记录,<span style="color: black;">便是</span><span style="color: black;">由于</span>踩了 Redis-py 在连接取消状态下会话错乱的 bug,CEO Sam Altman <span style="color: black;">亦</span>在 twitter 出来道歉。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><img src="https://mmbiz.qpic.cn/mmbiz_png/IicVLYINBJfgzlic0nE6GFEsrQhEGdD7jicEwY627XqfGdDWUEWhn5TPp5J0ZB3OqyDVMQFgHX14BSzvoQtLQQYibw/640?wx_fmt=png&tp=webp&wxfrom=5&wx_lazy=1&wx_co=1" style="width: 50%; margin-bottom: 20px;"></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">另一</span>还在招聘中<span style="color: black;">期盼</span>懂 CosmosDB,这个<span style="color: black;">日前</span>还不确认用在什么场景,CosmosDB 是 Azure 推出的多模数据库,支持 MongoDB、Cassandra、PostgreSQL、Gremlin 等兼容性接口,是微软在 NoSQL <span style="color: black;">行业</span>的旗舰<span style="color: black;">制品</span>。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">之前 Patrick-McFadin (Cassandra Committer)在 LinkedIn 上<span style="color: black;">说到</span> OpenAI <span style="color: black;">运用</span>了 Cassandra,但<span style="color: black;">最后</span>讨论下来是 ChatGPT 自己胡说的,还<span style="color: black;">不可</span>确认<span style="color: black;">是不是</span>真的<span style="color: black;">运用</span>了 Cassandra <span style="color: black;">或</span>是在 CosmosDB 中<span style="color: black;">运用</span>了 Cassandra 的 API。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">另一</span>CosmosDB<span style="color: black;">亦</span><span style="color: black;">经过</span>收购过来的Citus<span style="color: black;">供给</span>的分布式PostgreSQL的接口。不确认<span style="color: black;">是不是</span><span style="color: black;">运用</span>了Citus来完<span style="color: black;">成份</span>布式数据库架构。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">ChatGPT <span style="color: black;">运用</span>了 Snowflake 这个新一代的云原生数据仓库,并且<span style="color: black;">运用</span>了 Tableau 来做数据分析。支持多云<span style="color: black;">安排</span>的 Snowflake <span style="color: black;">针对</span>业务从 AWS 到 Azure的跨云迁移是非常有利的。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><strong style="color: blue;">3、前端</strong></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">在Web前端方面,比较<span style="color: black;">知道</span>的是<span style="color: black;">运用</span>了 TypeScript 语言和 React 框架。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">移动端<span style="color: black;">日前</span> ChatGPT 还<span style="color: black;">无</span>正式的 APP 发布,<span style="color: black;">不外</span><span style="color: black;">已然</span>在招聘 iOS 和 Android 工程师,应该快了。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><strong style="color: blue;">4、应用与服务编程语言</strong></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">做为</span>以 AI 为<span style="color: black;">基本</span>的科技<span style="color: black;">机构</span>,OpenAI <span style="color: black;">选取</span> Python 为核心语言,不管是 AI 岗位还是平台软件工程师岗位,都是需要精通 Python 语言。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">平台服务<span style="color: black;">选取</span> Python,<span style="color: black;">运用</span>了 Python 里比较流行的 Flask 框架,并<span style="color: black;">运用</span>了 OpenAPI 等组件。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><strong style="color: blue;">5、AI 技术框架</strong></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">AI 技术框架核心是 Pytorch,可能<span style="color: black;">亦</span><span style="color: black;">运用</span>了 Tensorflow。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">OpenAI 自己<span style="color: black;">开发</span>了面向 GPU 的算法框架 Triton,并且在 github 上开源了(https://github.com/openai/triton),用于代替 NVIDIA 的 CUDA,<span style="color: black;">目的</span>是能更<span style="color: black;">有效</span>的<span style="color: black;">研发</span><span style="color: black;">设备</span>学习算法。Triton 的编程语言<span style="color: black;">运用</span>了 c++和 python。</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">关于更细节的 AI 训练和<span style="color: black;">安排</span>服务架构还待挖掘。下图是来自网上陈巍博士分享的一张大模型典型架构图,供参考:</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><img src="https://mmbiz.qpic.cn/mmbiz/IicVLYINBJfgzlic0nE6GFEsrQhEGdD7jicWlFxnaF2SvLx0aFibCaXeFjnFMFS676kUdI3YGtRZgTOeuGNKCV8PJQ/640?wx_fmt=jpeg&tp=webp&wxfrom=5&wx_lazy=1&wx_co=1" style="width: 50%; margin-bottom: 20px;"></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">https://zhuanlan.zhihu.com/p/611464068</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><strong style="color: blue;">6、应用<span style="color: black;">安排</span>与监控运维</strong></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><strong style="color: blue;"><span style="color: black;">容器服务</span></strong></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">:Kubernetes</p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><strong style="color: blue;"><span style="color: black;">监控与运维</span></strong><span style="color: black;">:Prometheues(招聘运维<span style="color: black;">研发</span>工程师<span style="color: black;">说到</span>需要懂PromQL)</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><strong style="color: blue;"><span style="color: black;">日志服务:</span></strong><span style="color: black;">Splunk</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><strong style="color: blue;"><span style="color: black;">编程语言:</span></strong><span style="color: black;">Golang、Python</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><strong style="color: blue;">7、参考文档</strong></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">▋</span><span style="color: black;">1. ChatGPT软件工程师岗位描述:</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">https://openai.com/careers/software-engineer-chatgpt</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><strong style="color: blue;"><span style="color: black;">▋</span></strong></span><span style="color: black;">2. OpenAI 2023.2.20 故障分析报告:</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">https://status.openai.com/incidents/mq5jgswy45fr</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><strong style="color: blue;"><span style="color: black;">▋</span></strong></span><span style="color: black;">3. OpenAI开源Triton的Github<span style="color: black;">位置</span>:</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">https://github.com/openai/triton</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><strong style="color: blue;"><span style="color: black;">▋</span></strong>4. Patrick McFadin在Linkin上讨论ChatGPT<span style="color: black;">是不是</span><span style="color: black;">运用</span>了Cassandra:</span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;">https://www.linkedin.com/posts/patrick-mcfadin-53a8046_this-isnt-your-regular-chatgpt-post-so-activity-7031372446536515584-P8fg/?utm_source=share&utm_medium=member_desktop</span></p>
软文发布平台 http://www.fok120.com/ 回顾历史,我们感慨万千;放眼未来,我们信心百倍。
页:
[1]