音频转录 API 是一种高效且准确的解决方案,旨在将口语转换为结构化文本。它利用先进的语音识别和人工智能技术,提供跨各种行业和应用案例的高质量转录。无论您是在处理实时语音还是预录音频文件,此 API 都能确保以最小的错误实现无瑕疵和可靠的转换
此 API 最突出的一项功能是其多语言支持,允许用户以高准确度转录多种语言的音频。这使其成为需要不同语言转录的用户的宝贵工具
获取转录 - 端点功能
| 对象 | 描述 |
|---|---|
url |
[必需] Indicates a URL |
{"success":true,"audio_file":"https://s17.aconvert.com/convert/p3r68-cdx67/5i1cf-a7awv.mp3","output":{"text":"Football, known as soccer in the United States and Canada, is more than just a sport. It is a global phenomenon that transcends cultural, economic, and geographic barriers, bringing people together in ways few other activities can. With over 3.5 billion fans worldwide, football is often referred to as the beautiful game for its simplicity, inclusiveness, and ability to unite people across different backgrounds. This essay explores","result":{"text":"Football, known as soccer in the United States and Canada, is more than just a sport. It is a global phenomenon that transcends cultural, economic, and geographic barriers, bringing people together in ways few other activities can. With over 3.5 billion fans worldwide, football is often referred to as the beautiful game for its simplicity, inclusiveness, and ability to unite people across different backgrounds. This essay explores","word_count":68,"vtt":"WEBVTT\n\n00.000 --> 01.760\nFootball, known as soccer in\n\n01.760 --> 03.340\nthe United States and Canada,\n\n03.340 --> 04.380\nis more than just a\n\n04.380 --> 05.800\nsport. It is a global\n\n05.800 --> 08.680\nphenomenon that transcends cultural, economic,\n\n08.680 --> 10.720\nand geographic barriers, bringing people\n\n10.720 --> 12.040\ntogether in ways few other\n\n12.040 --> 14.900\nactivities can. With over 3\n\n14.900 --> 17.480\n.5 billion fans worldwide, football\n\n17.480 --> 18.740\nis often referred to as\n\n18.740 --> 20.180\nthe beautiful game for its\n\n20.180 --> 22.980\nsimplicity, inclusiveness, and ability to\n\n22.980 --> 25.720\nunite people across different backgrounds.\n\n25.720 --> 26.960\nThis essay explores","words":[{"word":"Football,","start":0,"end":0.6600000262260437},{"word":"known","start":0.6600000262260437,"end":0.8199999928474426},{"word":"as","start":0.8199999928474426,"end":1.0399999618530273},{"word":"soccer","start":1.0399999618530273,"end":1.340000033378601},{"word":"in","start":1.340000033378601,"end":1.7599999904632568},{"word":"the","start":1.7599999904632568,"end":1.8600000143051147},{"word":"United","start":1.8600000143051147,"end":2.0999999046325684},{"word":"States","start":2.0999999046325684,"end":2.440000057220459},{"word":"and","start":2.440000057220459,"end":2.6600000858306885},{"word":"Canada,","start":2.6600000858306885,"end":3.3399999141693115},{"word":"is","start":3.3399999141693115,"end":3.5199999809265137},{"word":"more","start":3.5199999809265137,"end":3.799999952316284},{"word":"than","start":3.799999952316284,"end":4},{"word":"just","start":4,"end":4.199999809265137},{"word":"a","start":4.199999809265137,"end":4.380000114440918},{"word":"sport.","start":4.380000114440918,"end":5.079999923706055},{"word":"It","start":5.079999923706055,"end":5.21999979019165},{"word":"is","start":5.21999979019165,"end":5.320000171661377},{"word":"a","start":5.320000171661377,"end":5.420000076293945},{"word":"global","start":5.420000076293945,"end":5.800000190734863},{"word":"phenomenon","start":5.800000190734863,"end":6.300000190734863},{"word":"that","start":6.300000190734863,"end":6.659999847412109},{"word":"transcends","start":6.659999847412109,"end":7.159999847412109},{"word":"cultural,","start":7.159999847412109,"end":7.980000019073486},{"word":"economic,","start":7.980000019073486,"end":8.680000305175781},{"word":"and","start":8.680000305175781,"end":8.760000228881836},{"word":"geographic","start":8.760000228881836,"end":9.15999984741211},{"word":"barriers,","start":9.15999984741211,"end":10.199999809265137},{"word":"bringing","start":10.199999809265137,"end":10.380000114440918},{"word":"people","start":10.380000114440918,"end":10.720000267028809},{"word":"together","start":10.720000267028809,"end":11.039999961853027},{"word":"in","start":11.039999961853027,"end":11.239999771118164},{"word":"ways","start":11.239999771118164,"end":11.399999618530273},{"word":"few","start":11.399999618530273,"end":11.760000228881836},{"word":"other","start":11.760000228881836,"end":12.039999961853027},{"word":"activities","start":12.039999961853027,"end":12.539999961853027},{"word":"can.","start":12.539999961853027,"end":14.239999771118164},{"word":"With","start":14.239999771118164,"end":14.420000076293945},{"word":"over","start":14.420000076293945,"end":14.619999885559082},{"word":"3","start":14.619999885559082,"end":14.899999618530273},{"word":".5","start":14.899999618530273,"end":15.420000076293945},{"word":"billion","start":15.420000076293945,"end":15.760000228881836},{"word":"fans","start":15.760000228881836,"end":16.040000915527344},{"word":"worldwide,","start":16.040000915527344,"end":17.139999389648438},{"word":"football","start":17.139999389648438,"end":17.479999542236328},{"word":"is","start":17.479999542236328,"end":17.739999771118164},{"word":"often","start":17.739999771118164,"end":18},{"word":"referred","start":18,"end":18.299999237060547},{"word":"to","start":18.299999237060547,"end":18.540000915527344},{"word":"as","start":18.540000915527344,"end":18.739999771118164},{"word":"the","start":18.739999771118164,"end":18.899999618530273},{"word":"beautiful","start":18.899999618530273,"end":19.239999771118164},{"word":"game","start":19.239999771118164,"end":19.68000030517578},{"word":"for","start":19.68000030517578,"end":20.040000915527344},{"word":"its","start":20.040000915527344,"end":20.18000030517578},{"word":"simplicity,","start":20.18000030517578,"end":21.1200008392334},{"word":"inclusiveness,","start":21.1200008392334,"end":22.020000457763672},{"word":"and","start":22.020000457763672,"end":22.239999771118164},{"word":"ability","start":22.239999771118164,"end":22.6200008392334},{"word":"to","start":22.6200008392334,"end":22.979999542236328},{"word":"unite","start":22.979999542236328,"end":23.239999771118164},{"word":"people","start":23.239999771118164,"end":23.68000030517578},{"word":"across","start":23.68000030517578,"end":24.040000915527344},{"word":"different","start":24.040000915527344,"end":24.459999084472656},{"word":"backgrounds.","start":24.459999084472656,"end":25.719999313354492},{"word":"This","start":25.719999313354492,"end":26.059999465942383},{"word":"essay","start":26.059999465942383,"end":26.360000610351562},{"word":"explores","start":26.360000610351562,"end":26.959999084472656}]}}}
curl --location --request POST 'https://zylalabs.com/api/6369/audio+transcription+api/9134/get+transcription?url=https://s17.aconvert.com/convert/p3r68-cdx67/5i1cf-a7awv.mp3' --header 'Authorization: Bearer YOUR_API_KEY'
| 标头 | 描述 |
|---|---|
授权
|
[必需] 应为 Bearer access_key. 订阅后,请查看上方的"您的 API 访问密钥"。 |
无长期承诺。随时升级、降级或取消。 免费试用包括最多 50 个请求。
获取转录端点返回源自音频输入的结构化文本数据。这包括转录的文本、时间戳,以及根据音频的复杂性可能包含的说话者识别
响应数据中的关键字段通常包括“转录”(转换后的文本)、“语言”(检测到的语言)和“置信度”(转录的准确度评分)
获取转录端点的主要参数是“audio_url”,它指定要转录的音频文件的位置。附加参数可能包括“language”,以指定所需的转录语言
响应数据以JSON格式组织,键值对表示转录结果。该结构便于解析和集成到应用程序中
典型的使用案例包括转录采访 生成视频字幕 创建会议记录 以及将播客转换为文本以便于无障碍和搜索引擎优化
数据准确性通过先进的语音识别算法和对多样化音频数据集的持续培训得以维护 定期更新和质量检查确保高转录忠实度
用户可以通过指定“语言”参数来自定义他们的请求,以转录不同语言的音频,从而增强API在多语言应用中的灵活性
标准数据模式包括带标点的清晰文本输出 在复杂音频中有说话人区分 并且每个段落都有时间戳 允许用户跟踪特定短语被说出时的时间
服务级别:
100%
响应时间:
731ms
服务级别:
100%
响应时间:
84ms
服务级别:
100%
响应时间:
11,049ms
服务级别:
100%
响应时间:
13,953ms
服务级别:
100%
响应时间:
0ms
服务级别:
100%
响应时间:
64ms
服务级别:
100%
响应时间:
0ms
服务级别:
100%
响应时间:
444ms
服务级别:
91%
响应时间:
3,258ms
服务级别:
100%
响应时间:
1,148ms
服务级别:
100%
响应时间:
451ms
服务级别:
100%
响应时间:
630ms
服务级别:
100%
响应时间:
2,714ms
服务级别:
100%
响应时间:
1,033ms
服务级别:
100%
响应时间:
607ms
服务级别:
100%
响应时间:
3,154ms
服务级别:
100%
响应时间:
273ms
服务级别:
100%
响应时间:
537ms
服务级别:
100%
响应时间:
428ms
服务级别:
100%
响应时间:
134ms