La API de Transcripción de Audio es una solución altamente eficiente y precisa diseñada para convertir palabras habladas en texto estructurado. Aprovecha el reconocimiento de voz avanzado y la inteligencia artificial para ofrecer transcripciones de alta calidad en una variedad de industrias y casos de uso. Ya sea que esté procesando discurso en vivo o archivos de audio pregrabados, esta API garantiza una conversión impecable y confiable con un mínimo de errores.
Una de las características más destacadas de esta API es su soporte multilingüe, que permite a los usuarios transcribir audio en varios idiomas con alta precisión. Esto la convierte en una herramienta valiosa para los usuarios que necesitan transcripciones en diferentes idiomas.
Para usar este punto de acceso, debe indicar la URL de un audio en el parámetro.
Obtener transcripción - Características del Endpoint
| Objeto | Descripción |
|---|---|
url |
[Requerido] Indicates a URL |
{"success":true,"audio_file":"https://s17.aconvert.com/convert/p3r68-cdx67/5i1cf-a7awv.mp3","output":{"text":"Football, known as soccer in the United States and Canada, is more than just a sport. It is a global phenomenon that transcends cultural, economic, and geographic barriers, bringing people together in ways few other activities can. With over 3.5 billion fans worldwide, football is often referred to as the beautiful game for its simplicity, inclusiveness, and ability to unite people across different backgrounds. This essay explores","result":{"text":"Football, known as soccer in the United States and Canada, is more than just a sport. It is a global phenomenon that transcends cultural, economic, and geographic barriers, bringing people together in ways few other activities can. With over 3.5 billion fans worldwide, football is often referred to as the beautiful game for its simplicity, inclusiveness, and ability to unite people across different backgrounds. This essay explores","word_count":68,"vtt":"WEBVTT\n\n00.000 --> 01.760\nFootball, known as soccer in\n\n01.760 --> 03.340\nthe United States and Canada,\n\n03.340 --> 04.380\nis more than just a\n\n04.380 --> 05.800\nsport. It is a global\n\n05.800 --> 08.680\nphenomenon that transcends cultural, economic,\n\n08.680 --> 10.720\nand geographic barriers, bringing people\n\n10.720 --> 12.040\ntogether in ways few other\n\n12.040 --> 14.900\nactivities can. With over 3\n\n14.900 --> 17.480\n.5 billion fans worldwide, football\n\n17.480 --> 18.740\nis often referred to as\n\n18.740 --> 20.180\nthe beautiful game for its\n\n20.180 --> 22.980\nsimplicity, inclusiveness, and ability to\n\n22.980 --> 25.720\nunite people across different backgrounds.\n\n25.720 --> 26.960\nThis essay explores","words":[{"word":"Football,","start":0,"end":0.6600000262260437},{"word":"known","start":0.6600000262260437,"end":0.8199999928474426},{"word":"as","start":0.8199999928474426,"end":1.0399999618530273},{"word":"soccer","start":1.0399999618530273,"end":1.340000033378601},{"word":"in","start":1.340000033378601,"end":1.7599999904632568},{"word":"the","start":1.7599999904632568,"end":1.8600000143051147},{"word":"United","start":1.8600000143051147,"end":2.0999999046325684},{"word":"States","start":2.0999999046325684,"end":2.440000057220459},{"word":"and","start":2.440000057220459,"end":2.6600000858306885},{"word":"Canada,","start":2.6600000858306885,"end":3.3399999141693115},{"word":"is","start":3.3399999141693115,"end":3.5199999809265137},{"word":"more","start":3.5199999809265137,"end":3.799999952316284},{"word":"than","start":3.799999952316284,"end":4},{"word":"just","start":4,"end":4.199999809265137},{"word":"a","start":4.199999809265137,"end":4.380000114440918},{"word":"sport.","start":4.380000114440918,"end":5.079999923706055},{"word":"It","start":5.079999923706055,"end":5.21999979019165},{"word":"is","start":5.21999979019165,"end":5.320000171661377},{"word":"a","start":5.320000171661377,"end":5.420000076293945},{"word":"global","start":5.420000076293945,"end":5.800000190734863},{"word":"phenomenon","start":5.800000190734863,"end":6.300000190734863},{"word":"that","start":6.300000190734863,"end":6.659999847412109},{"word":"transcends","start":6.659999847412109,"end":7.159999847412109},{"word":"cultural,","start":7.159999847412109,"end":7.980000019073486},{"word":"economic,","start":7.980000019073486,"end":8.680000305175781},{"word":"and","start":8.680000305175781,"end":8.760000228881836},{"word":"geographic","start":8.760000228881836,"end":9.15999984741211},{"word":"barriers,","start":9.15999984741211,"end":10.199999809265137},{"word":"bringing","start":10.199999809265137,"end":10.380000114440918},{"word":"people","start":10.380000114440918,"end":10.720000267028809},{"word":"together","start":10.720000267028809,"end":11.039999961853027},{"word":"in","start":11.039999961853027,"end":11.239999771118164},{"word":"ways","start":11.239999771118164,"end":11.399999618530273},{"word":"few","start":11.399999618530273,"end":11.760000228881836},{"word":"other","start":11.760000228881836,"end":12.039999961853027},{"word":"activities","start":12.039999961853027,"end":12.539999961853027},{"word":"can.","start":12.539999961853027,"end":14.239999771118164},{"word":"With","start":14.239999771118164,"end":14.420000076293945},{"word":"over","start":14.420000076293945,"end":14.619999885559082},{"word":"3","start":14.619999885559082,"end":14.899999618530273},{"word":".5","start":14.899999618530273,"end":15.420000076293945},{"word":"billion","start":15.420000076293945,"end":15.760000228881836},{"word":"fans","start":15.760000228881836,"end":16.040000915527344},{"word":"worldwide,","start":16.040000915527344,"end":17.139999389648438},{"word":"football","start":17.139999389648438,"end":17.479999542236328},{"word":"is","start":17.479999542236328,"end":17.739999771118164},{"word":"often","start":17.739999771118164,"end":18},{"word":"referred","start":18,"end":18.299999237060547},{"word":"to","start":18.299999237060547,"end":18.540000915527344},{"word":"as","start":18.540000915527344,"end":18.739999771118164},{"word":"the","start":18.739999771118164,"end":18.899999618530273},{"word":"beautiful","start":18.899999618530273,"end":19.239999771118164},{"word":"game","start":19.239999771118164,"end":19.68000030517578},{"word":"for","start":19.68000030517578,"end":20.040000915527344},{"word":"its","start":20.040000915527344,"end":20.18000030517578},{"word":"simplicity,","start":20.18000030517578,"end":21.1200008392334},{"word":"inclusiveness,","start":21.1200008392334,"end":22.020000457763672},{"word":"and","start":22.020000457763672,"end":22.239999771118164},{"word":"ability","start":22.239999771118164,"end":22.6200008392334},{"word":"to","start":22.6200008392334,"end":22.979999542236328},{"word":"unite","start":22.979999542236328,"end":23.239999771118164},{"word":"people","start":23.239999771118164,"end":23.68000030517578},{"word":"across","start":23.68000030517578,"end":24.040000915527344},{"word":"different","start":24.040000915527344,"end":24.459999084472656},{"word":"backgrounds.","start":24.459999084472656,"end":25.719999313354492},{"word":"This","start":25.719999313354492,"end":26.059999465942383},{"word":"essay","start":26.059999465942383,"end":26.360000610351562},{"word":"explores","start":26.360000610351562,"end":26.959999084472656}]}}}
curl --location --request POST 'https://zylalabs.com/api/6369/audio+transcription+api/9134/get+transcription?url=https://s17.aconvert.com/convert/p3r68-cdx67/5i1cf-a7awv.mp3' --header 'Authorization: Bearer YOUR_API_KEY'
| Encabezado | Descripción |
|---|---|
Autorización
|
[Requerido] Debería ser Bearer access_key. Consulta "Tu Clave de Acceso a la API" arriba cuando estés suscrito. |
Sin compromiso a largo plazo. Mejora, reduce o cancela en cualquier momento. La Prueba Gratuita incluye hasta 50 solicitudes.
El punto final de Obtener Transcripción devuelve datos de texto estructurados derivados de la entrada de audio. Esto incluye el texto transcrito, las marcas de tiempo y, potencialmente, la identificación del hablante, dependiendo de la complejidad del audio.
Los campos clave en los datos de respuesta suelen incluir "transcripción" (el texto convertido), "idioma" (el idioma detectado) y "confianza" (la puntuación de precisión de la transcripción).
El parámetro principal para el endpoint Get Transcription es el "audio_url", que especifica la ubicación del archivo de audio que se va a transcribir. Los parámetros adicionales pueden incluir "language" para especificar el idioma de transcripción deseado.
Los datos de respuesta están organizados en un formato JSON, con pares clave-valor que representan los resultados de la transcripción. Esta estructura permite una fácil interpretación e integración en aplicaciones.
Los casos de uso típicos incluyen transcribir entrevistas, generar subtítulos para videos, crear notas de reuniones y convertir pódcast en texto para fines de accesibilidad y SEO.
La precisión de los datos se mantiene a través de avanzados algoritmos de reconocimiento de voz y un entrenamiento continuo en diversos conjuntos de datos de audio. Actualizaciones regulares y controles de calidad aseguran una alta fidelidad en la transcripción.
Los usuarios pueden personalizar sus solicitudes especificando el parámetro "idioma" para transcribir audio en diferentes idiomas, lo que mejora la versatilidad de la API para aplicaciones multilingües.
Los patrones de datos estándar incluyen salida de texto claro con puntuación, diferenciación de hablantes en audio complejo y sellos de tiempo para cada segmento, lo que permite a los usuarios rastrear cuándo se pronunciaron frases específicas.
Nivel de Servicio:
100%
Tiempo de Respuesta:
731ms
Nivel de Servicio:
100%
Tiempo de Respuesta:
0ms
Nivel de Servicio:
100%
Tiempo de Respuesta:
0ms
Nivel de Servicio:
100%
Tiempo de Respuesta:
84ms
Nivel de Servicio:
100%
Tiempo de Respuesta:
65ms
Nivel de Servicio:
100%
Tiempo de Respuesta:
646ms
Nivel de Servicio:
100%
Tiempo de Respuesta:
13.953ms
Nivel de Servicio:
100%
Tiempo de Respuesta:
1.360ms
Nivel de Servicio:
100%
Tiempo de Respuesta:
1.277ms
Nivel de Servicio:
100%
Tiempo de Respuesta:
0ms
Nivel de Servicio:
100%
Tiempo de Respuesta:
19.536ms
Nivel de Servicio:
100%
Tiempo de Respuesta:
2.073ms
Nivel de Servicio:
100%
Tiempo de Respuesta:
953ms
Nivel de Servicio:
100%
Tiempo de Respuesta:
3.739ms
Nivel de Servicio:
100%
Tiempo de Respuesta:
469ms
Nivel de Servicio:
100%
Tiempo de Respuesta:
2.173ms
Nivel de Servicio:
100%
Tiempo de Respuesta:
376ms
Nivel de Servicio:
100%
Tiempo de Respuesta:
19.536ms
Nivel de Servicio:
100%
Tiempo de Respuesta:
19.536ms
Nivel de Servicio:
100%
Tiempo de Respuesta:
19.536ms