抢先OpenAI!Hume AI发布第二代情感智能AI,支持自定义语音
<div style="color: black; text-align: left; margin-bottom: 10px;"><img src="https://p26-sign.toutiaoimg.com/tos-cn-i-axegupay5k/494375738cc9425dbe03493a2defd603~noop.image?_iz=58558&from=article.pc_detail&lk3s=953192f4&x-expires=1727626226&x-signature=Eo70sJ1SSsrNilgRoA5r7WAQUBk%3D" style="width: 50%; margin-bottom: 20px;"></div>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><strong style="color: blue;"><span style="color: black;">编译 | Vendii</span></strong><strong style="color: blue;"><span style="color: black;">编辑 | 漠影</span></strong></span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">智东西9月19日<span style="color: black;">信息</span>,据VentureBeat今日<span style="color: black;">报告</span>,AI情感创企Hume AI于9月11日发布了Empathic Voice Interface 2(EVI 2)。</span></span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">EVI被宣<span style="color: black;">叫作</span>为<span style="color: black;">全世界</span>首个<span style="color: black;">拥有</span>情商的对话式AI。EVI能够<span style="color: black;">经过</span>分析用户的语音,如口音、语气、语调、拟声词、节奏和停顿等,来理解用户的<span style="color: black;">心情</span>和心理状态,并做出实时响应。</span></span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">与EVI 1相比,新发布的EVI 2的响应延迟减少了40%,且成本降低了30%。<span style="color: black;">另外</span>,新一代EVI还进行了一系列功能<span style="color: black;">加强</span>与更新:语音质量的<span style="color: black;">加强</span>,情商与同理心的<span style="color: black;">加强</span>,支持自定义语音……</span></span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">Hume AI由前谷歌DeepMind<span style="color: black;">科研</span>员Alan Cowen于2021年创立,他<span style="color: black;">此刻</span>担任该<span style="color: black;">机构</span>的首席执行官兼首席<span style="color: black;">专家</span>。该<span style="color: black;">机构</span>于今年3月27日完<span style="color: black;">成为了</span>5000万美元的B轮融资。</span></span></p>
<div style="color: black; text-align: left; margin-bottom: 10px;"><img src="https://p3-sign.toutiaoimg.com/tos-cn-i-6w9my0ksvp/fdadc8dd8f164013b53b9f8f96809163~noop.image?_iz=58558&from=article.pc_detail&lk3s=953192f4&x-expires=1727626226&x-signature=L5V4i3aSC%2FVg5Vw5zWQx6ZDI13M%3D" style="width: 50%; margin-bottom: 20px;"></div>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">官网<span style="color: black;">位置</span>:https://www.hume.ai/</span></span></p>
<h1 style="color: black; text-align: left; margin-bottom: 10px;"><span style="color: black;">1、</span>功能<span style="color: black;">加强</span>:语音质量和情商的<span style="color: black;">提高</span>,还支持自定义语音</h1>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">EVI 2集<span style="color: black;">成为了</span>一个先进的语音生成模型和情感大型语言模型(eLLM),能够处理和生成文本及音频。这种多模态<span style="color: black;">办法</span>使得EVI 2生成的语音听起来更自然,语调更恰当,表现力更高,输出更连续。</span></span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;"><span style="color: black;">另外</span>,在同一模型中处理语音和语言,使得EVI 2<span style="color: black;">能够</span>更好地理解用户输入内容的情感倾向,从而做出相应<span style="color: black;">调节</span>,在内容和语气方面生成更<span style="color: black;">拥有</span>同理心的响应。</span></span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">除了在语音质量和情商方面的<span style="color: black;">提高</span>,新一代EVI 2还支持用户自定义语音。<span style="color: black;">研发</span>人员<span style="color: black;">能够</span>设置音调、鼻音和性别等参数,<span style="color: black;">按照</span>特定的应用<span style="color: black;">需要</span>定制EVI 2的语音,<span style="color: black;">例如</span>应用于客服<span style="color: black;">设备</span>人、虚拟AI助手。</span></span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">EVI 2还支持用户在交互过程中<span style="color: black;">经过</span>语音提示,动态修改EVI 2的说话风格。例如,“说得更快”、“语调听起来很兴奋”,<span style="color: black;">乃至</span>还<span style="color: black;">能够</span>“进行说唱“。</span></span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;"><span style="color: black;">按照</span>Hume AI的介绍,EVI 2还能够与其他应用程序、大语言模型进行集成,在客服通话、网页搜索等功能中<span style="color: black;">运用</span>。</span></span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">Cowen在上周与VentureBeat的视频通话中谈道:“<span style="color: black;">咱们</span><span style="color: black;">期盼</span><span style="color: black;">研发</span>者能够将这个模型集成到任何应用中,创建<span style="color: black;">她们</span>想要的品牌语音,并<span style="color: black;">按照</span><span style="color: black;">她们</span>的用户<span style="color: black;">需要</span>进行<span style="color: black;">调节</span>,使其品牌语音变得值得信赖且<span style="color: black;">拥有</span>个性。”</span></span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;"><span style="color: black;">另外</span>,他透露道,EVI 2并不打算<span style="color: black;">供给</span>语音克隆的功能。</span></span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">“<span style="color: black;">咱们</span>当然<span style="color: black;">能够</span>用<span style="color: black;">咱们</span>的模型克隆声音,但<span style="color: black;">咱们</span><span style="color: black;">无</span><span style="color: black;">供给</span>这一功能,<span style="color: black;">由于</span>它的<span style="color: black;">危害</span>太高、益处<span style="color: black;">亦</span>不清晰。”他解释道,“人们真正想要的是能够定制声音。<span style="color: black;">咱们</span><span style="color: black;">研发</span>了新的语音,让用户<span style="color: black;">能够</span>创建<span style="color: black;">区别</span>的个性化语音。相比于克隆特定声音,<span style="color: black;">研发</span>者似乎对创建新语音更感兴趣。”</span></span></p>
<div style="color: black; text-align: left; margin-bottom: 10px;"><img src="https://p3-sign.toutiaoimg.com/tos-cn-i-6w9my0ksvp/acd8c02a58a049348418d039d38c6dc5~noop.image?_iz=58558&from=article.pc_detail&lk3s=953192f4&x-expires=1727626226&x-signature=mYssBMvFO8YHP36ZweL%2FG0CJALQ%3D" style="width: 50%; margin-bottom: 20px;"></div>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;">定制语音功能体验<span style="color: black;">位置</span>:</p>https://platform.hume.ai/evi/voices
<h1 style="color: black; text-align: left; margin-bottom: 10px;"><span style="color: black;">2、</span>性价比<span style="color: black;">加强</span>:响应延迟降低40%,定价降低30%,年底预计能支持<span style="color: black;">更加多</span>语言</h1>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">EVI 2与EVI 1相比,延迟降低了40%,<span style="color: black;">此刻</span>平均响应时间在500到800毫秒之间。速度的改进使对话响应更快、更像人类。</span></span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">EVI 2还有一大亮点是其成本效益的<span style="color: black;">加强</span>。Hume AI将EVI 2的定价降低了约30%,从<span style="color: black;">第1</span>代的每分钟0.102美元降低到每分钟0.072美元。企业用户还<span style="color: black;">能够</span>享受批量折扣。</span></span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;"><span style="color: black;">不外</span>,<span style="color: black;">按照</span>VentureBeat的计算,OpenAI<span style="color: black;">日前</span><span style="color: black;">供给</span>的文本转语音服务(非新推出的ChatGPT高级语音模式)要比Hume AI的EVI 2便宜<span style="color: black;">非常多</span>。OpenAI的文本转语音服务每1000字符收费0.015美元(大约每分钟语音0.015美元),而Hume AI的EVI 2为每分钟0.072美元。</span></span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">EVI 2<span style="color: black;">日前</span>仅支持英语,Hume AI计划在2024年底之前推出对西班牙语、法语和德语等多种语言的支持。</span></span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">Cowen向VentureBeat透露道,得益于<span style="color: black;">她们</span>的训练过程,EVI 2<span style="color: black;">实质</span>上自主学习了多种语言,不需要由工程师进行人为的训练。</span></span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">“<span style="color: black;">咱们</span><span style="color: black;">无</span>专门训练模型输出某些特定的语言,但它从训练数据中学会了说法语、西班牙语、德语、波兰语等多种语言。”Cowen解释道。</span></span></p>
<h1 style="color: black; text-align: left; margin-bottom: 10px;">结语:先于竞争对手公<span style="color: black;">研发</span>布,有望抢占市场</h1>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">据传,Hume AI潜在的竞争对手Anthropic正在重新打造其投资方亚马逊的Alexa语音助手并准备推出。</span></span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">另一方面,OpenAI在今年5月展示的由GPT-4o模型支持的ChatGPT高级语音模式,<span style="color: black;">日前</span>只对<span style="color: black;">少许</span>用户开放,在候补名单中的用户仍需等待。</span></span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;">尽管Hume AI并<span style="color: black;">无</span>像OpenAI或Anthropic那样广为人知,但Hume AI<span style="color: black;">已然</span>抢先于它们公开推出了一个人性化语音助手,并且客户<span style="color: black;">此刻</span>就<span style="color: black;">能够</span>立即将其投入<span style="color: black;">运用</span>。这可能为Hume AI在竞争激烈的市场中抢占一席之地。</span></span></p>
<p style="font-size: 16px; color: black; line-height: 40px; text-align: left; margin-bottom: 15px;"><span style="color: black;"><span style="color: black;"><span style="color: black;">源自</span>:VentureBeat</span></span></p>
“沙发”(SF,第一个回帖的人)
页:
[1]