The HTML Extractor API is an advanced tool designed to facilitate the extraction and analysis of data from web pages by retrieving the full HTML content of those pages. This API is useful for users, who need to access information contained in web sites for various purposes, such as market research, competition monitoring, or web application development.
Main Features:
Full HTML Code Retrieval: The main function of the HTML Extractor API is to capture the complete HTML code of a specific web page. This includes all the structural content of the page, such as tags, attributes and embedded elements. By obtaining the complete HTML, users can have access to all visible and hidden information on the page, allowing for a comprehensive analysis of the content.
Support for Different Types of Web Pages: The API is versatile and supports a wide range of Web sites, from static pages to dynamic sites that generate content using JavaScript. The ability to handle different types of content makes the API suitable for a variety of applications, such as news data collection, social network monitoring, and complex web page structure analysis.
Specific Data Extraction: Although the API provides the full HTML, it can also be used to extract specific page data. Users can combine the API with HTML parsing techniques, such as the use of regular expressions or HTML processing libraries, to extract particular information such as product prices, contact details or any other relevant data.
In summary, the HTML Extractor API is a powerful and flexible tool for extracting HTML content from web pages. It offers an effective solution for those who need full access to web page content for analysis, research or development.Its ability to handle a variety of page types and its easy integration make it a valuable option for numerous use cases in web data management and analysis.
The API receives a URL of a web page and provides the full HTML content of that page for analysis and data extraction.
Competitor Research: Collect content from competitors' websites to analyze prices, products, promotions and marketing strategies.
News Monitoring: Extract content from news sites to keep up with the latest events and updates in real time.
Data Collection for Academic Research: Obtain and analyze content from multiple websites for academic research or case studies.
Web Application Development: Use the API to extract and parse HTML from the web applications themselves during development and testing.
SEO Analysis: Extract HTML from web pages to analyze important SEO elements such as meta tags, headings, and link structure.
Beside the number of API calls per month allowed, there are no other limitations.
To use this endpoint, send an HTTP request with the URL of the desired page and receive the full HTML content of the page.
Source Url - Endpoint Features
Object | Description |
---|---|
urlSupplier |
[Required] String |
forceCache |
[Required] boolean |
{"method":"GET","urlSupplier":"https:\/\/www.reuters.com\/article\/us-usa-economy-idUSKBN2A40BO","redirectedUrlSupplier":"https:\/\/www.reuters.com\/article\/us-usa-economy-idUSKBN2A40BO\/","pageSource":"\u003Chtml lang=\u0022ja\u0022 data-layout=\u0022regular-article\u0022\u003E\u003Chead\u003E\u003Ctitle\u003E\u68ee\u4f1a\u9577\u306e\u767a\u8a00\u554f\u984c\u3001\u307e\u305a\u7d44\u7e54\u59d4\u306e\u5bfe\u5fdc\u307f\u3066\u8003\u3048\u308b\uff1d\u52a0\u85e4\u5b98\u623f\u9577\u5b98 | \u30ed\u30a4\u30bf\u30fc\u003C\/title\u003E\u003Cmeta name=\u0022viewport\u0022 content=\u0022width=device-width, initial-scale=1\u0022\u003E\u003Cmeta name=\u0022apple-itunes-app\u0022 content=\u0022app-id=602660809, app-argument=https:\/\/www.reuters.com\/article\/opinion\/-idUSKBN2A40BN\/?id=KBN2A40BO\u0022\u003E\u003Cscript async=\u0022\u0022 src=\u0022https:\/\/js.datadome.co\/tags.js\u0022\u003E\u003C\/script\u003E\u003Cscript src=\u0022https:\/\/geolocation.onetrust.com\/cookieconsentpub\/v1\/geo\/location\/dnsfeed\u0022 async=\u0022\u0022 type=\u0022text\/javascript\u0022\u003E\u003C\/script\u003E\u003Cscript src=\u0022\/\/tru.am\/scripts\/ta-pagesocial-sdk.js\u0022\u003E\u003C\/script\u003E\u003Cscript type=\u0022text\/javascript\u0022 async=\u0022\u0022 src=\u0022\/\/img.en25.com\/i\/elqCfg.min.js\u0022\u003E\u003C\/script\u003E\u003Cscript type=\u0022text\/javascript\u0022 async=\u0022\u0022 src=\u0022https:\/\/cdn.segment.com\/analytics.js\/v1\/IEWBqQ8VWHijTQxb7lEBGFGS9uIJzigZ\/analytics.min.js\u0022\u003E\u003C\/script\u003E\u003Cscript async=\u0022\u0022 src=\u0022https:\/\/functionalfeather.com\/chunks\/4f0840710d72b\/1e91cf83e04ef163910e05eef.main.js\u0022\u003E\u003C\/script\u003E\u003Cscript async=\u0022\u0022 src=\u0022https:\/\/www.googletagmanager.com\/gtm.js?id=GTM-K5WTBZN\u0022\u003E\u003C\/script\u003E\u003Cscript\u003E(function(){\n var current_location = window.location.href;\n\n if (current_location.indexOf(\u0027\/info-pages\/supported-browsers\/\u0027) === -1) {\n var supportFetchApi = \u0027fetch\u0027 in window;\n var supportCSSGrid = window.CSS \u0026\u0026 CSS.supports(\u0027display\u0027, \u0027grid\u0027);\n\n if (!supportFetchApi \u0026\u0026 !supportCSSGrid) {\n window.location.href = \u0027\/info-pages\/supported-browsers\/\u0027;\n }\n }\n })()\u003C\/script\u003E\u003Cscript src=\u0022\/pf\/resources\/dist\/reuters\/js\/index.js?d=216\u0022 async=\u0022\u0022 data-config=\u0022{\u0026quot;API_ORIGIN\u0026quot;:\u0026quot;https:\/\/api-reuters-reuters-prod.cdn.arcpublishing.com\u0026quot;,\u0026quot;ADMIN\u0026quot;:false,\u0026quot;BUSINESS_COUNTRIES\u0026quot;:[],\u0026quot;CHARTBEAT_CONFIG\u0026quot;:{\u0026quot;title\u0026quot;:\u0026quot;\u68ee\u4f1a\u9577\u306e\u767a\u8a00\u554f\u984c\u3001\u307e\u305a\u7d44\u7e54\u59d4\u306e\u5bfe\u5fdc\u307f\u3066\u8003\u3048\u308b\uff1d\u52a0\u85e4\u5b98\u623f\u9577\u5b98\u0026quot;,\u0026quot;authors\u0026quot;:\u0026quot;Reuters\u0026quot;,\u0026quot;type\u0026quot;:\u0026quot;regular-article\u0026quot;,\u0026quot;domain\u0026quot;:\u0026quot;reuters.com\u0026quot;},\u0026quot;DEPLOYMENT\u0026quot;:\u0026quot;216\u0026quot;,\u0026quot;ELOQUA_SITE_ID\u0026quot;:\u0026quot;2124157686\u0026quot;,\u0026quot;MAX_NUMBER_OF_UNICODE_CHARS\u0026quot;:256,\u0026quot;ONETRUST_SCRIPT_ID\u0026quot;:\u0026quot;38cb75bd-fbe1-4ac8-b4af-e531ab368caf\u0026quot;,\u0026quot;PARSELY_SITE_ID\u0026quot;:\u0026quot;reuters.com\u0026quot;,\u0026quot;SEGMENT_WRITE_KEY\u0026quot;:\u0026quot;IEWBqQ8VWHijTQxb7lEBGFGS9uIJzigZ\u0026quot;,\u0026quot;SEGMENT_WRITE_KEY_MOBILE\u0026quot;:\u0026quot;YlmAIaFBxsNtlVJdfuSV0ncE931ghRtS\u0026quot;,\u0026quot;GRAPHICS_PLUGIN_IFRAME_URL\u0026quot;:\u0026quot;https:\/\/sphinx.thomsonreuters.com\/search\/?consumer=PageBuilder#\/search\/graphic\u0026quot;,\u0026quot;COMSCORE_CLIENT_ID\u0026quot;:\u0026quot;37296053\u0026quot;,\u0026quot;PAGE_CATEGORY\u0026quot;:\u0026quot;\u0026quot;,\u0026quot;PAGE_TYPE\u0026quot;:\u0026quot;\u0026quot;}\u0022\u003E\u003C\/script\u003E\u003Cscript src=\u0022https:\/\/www.reuters.com\/arc\/subs\/p.min.js\u0022 async=\u0022\u0022\u003E\u003C\/script\u003E\u003Cmeta property=\u0022fb:app_id\u0022 content=\u0022988502044532272\u0022\u003E\u003Cmeta property=\u0022fb:pages\u0022 content=\u0022114050161948682\u0022\u003E\u003Cmeta name=\u0022robots\u0022 content=\u0022noarchive, max-image-preview:large\u0022\u003E\u003Cmeta name=\u0022CCBot\u0022 content=\u0022nofollow\u0022\u003E\u003Clink href=\u0022https:\/\/www.googletagmanager.com\u0022 rel=\u0022preconnect\u0022\u003E\u003Clink href=\u0022https:\/\/connect.facebook.net\u0022 rel=\u0022preconnect\u0022\u003E\u003Cmeta property=\u0022og:title\u0022 content=\u0022\u68ee\u4f1a\u9577\u306e\u767a\u8a00\u554f\u984c\u3001\u307e\u305a\u7d44\u7e54\u59d4\u306e\u5bfe\u5fdc\u307f\u3066\u8003\u3048\u308b\uff1d\u52a0\u85e4\u5b98\u623f\u9577\u5b98\u0022\u003E\u003Cmeta property=\u0022og:type\u0022 content=\u0022article\u0022\u003E\u003Cmeta property=\u0022og:image\u0022 content=\u0022https:\/\/www.reuters.com\/pf\/resources\/images\/reuters\/reuters-default.webp?d=216\u0022\u003E\u003Cmeta property=\u0022og:url\u0022 content=\u0022https:\/\/www.reuters.com\/article\/opinion\/-idUSKBN2A40BN\/\u0022\u003E\u003Cmeta property=\u0022og:description\u0022 content=\u0022\u52a0\u85e4\u52dd\u4fe1\u5b98\u623f\u9577\u5b98\u306f\uff14\u65e5\u5348\u524d\u306e\u4f1a\u898b\u3067\u3001\u6771\u4eac\u4e94\u8f2a\u30fb\u30d1\u30e9\u30ea\u30f3\u30d4\u30c3\u30af\u7d44\u7e54\u59d4\u54e1\u4f1a\u306e\u68ee\u559c\u6717\u4f1a\u9577\u306e\u767a\u8a00\u304c\u5973\u6027\u8511\u8996\u3068\u6279\u5224\u3055\u308c\u3001\u8f9e\u4efb\u3092\u8981\u6c42\u3059\u308b\u58f0\u3082\u3042\u308b\u3053\u3068\u306b\u3064\u3044\u3066\u3001\u653f\u5e9c\u3068\u3057\u3066\u30b3\u30e1\u30f3\u30c8\u3057\u306a\u3044\u3068\u3057\u305f\u4e0a\u3067\u3001\u307e\u305a\u306f\u4e94\u8f2a\u7d44\u7e54\u59d4\u54e1\u4f1a\u3067\u306e\u5bfe\u5fdc\u304c\u57fa\u672c\u3060\u3068\u8ff0\u3079\u305f\u3002\u0022\u003E\u003Cmeta property=\u0022og:locale\u0022 content=\u0022en_US\u0022\u003E\u003Cmeta property=\u0022og:site_name\u0022 content=\u0022Reuters\u0022\u003E\u003Cmeta name=\u0022article:published_time\u0022 content=\u00222021-02-04T03:22:13.000Z\u0022\u003E\u003Cmeta name=\u0022article:modified_time\u0022 content=\u00222021-02-04T03:17:14.000Z\u0022\u003E\u003Cmeta name=\u0022article:author\u0022 content=\u0022Reuters\u0022\u003E\u003Cmeta name=\u0022article:tag\u0022 content=\u0022ASIA,ASXPAC,COMDIS,COVID,EASIA,ECON,GEN,GENHLT,HEA,HUMDIS,INFDIS,JDOM,JLN,JP,MOF,OLY,PLCY,POL,PUBHEA,SPO,WOM\u0022\u003E\u003Cmeta property=\u0022og:image:url\u0022 content=\u0022https:\/\/www.reuters.com\/pf\/resources\/images\/reuters\/reuters-default.webp?d=216\u0022\u003E\u003Cmeta property=\u0022og:image:width\u0022 content=\u00221200\u0022\u003E\u003Cmeta property=\u0022og:image:height\u0022 content=\u0022628\u0022\u003E\u003Cmeta property=\u0022og:image:alt\u0022 content=\u0022Reuters logo\u0022\u003E\u003Cmeta name=\u0022twitter:card\u0022 content=\u0022summary_large_image\u0022\u003E\u003Cmeta name=\u0022twitter:site\u0022 content=\u0022@Reuters\u0022\u003E\u003Cmeta name=\u0022twitter:creator\u0022 content=\u0022@Reuters\u0022\u003E\u003Cmeta name=\u0022twitter:description\u0022 content=\u0022\u52a0\u85e4\u52dd\u4fe1\u5b98\u623f\u9577\u5b98\u306f\uff14\u65e5\u5348\u524d\u306e\u4f1a\u898b\u3067\u3001\u6771\u4eac\u4e94\u8f2a\u30fb\u30d1\u30e9\u30ea\u30f3\u30d4\u30c3\u30af\u7d44\u7e54\u59d4\u54e1\u4f1a\u306e\u68ee\u559c\u6717\u4f1a\u9577\u306e\u767a\u8a00\u304c\u5973\u6027\u8511\u8996\u3068\u6279\u5224\u3055\u308c\u3001\u8f9e\u4efb\u3092\u8981\u6c42\u3059\u308b\u58f0\u3082\u3042\u308b\u3053\u3068\u306b\u3064\u3044\u3066\u3001\u653f\u5e9c\u3068\u3057\u3066\u30b3\u30e1\u30f3\u30c8\u3057\u306a\u3044\u3068\u3057\u305f\u4e0a\u3067\u3001\u307e\u305a\u306f\u4e94\u8f2a\u7d44\u7e54\u59d4\u54e1\u4f1a\u3067\u306e\u5bfe\u5fdc\u304c\u57fa\u672c\u3060\u3068\u8ff0\u3079\u305f\u3002\u0022\u003E\u003Cmeta name=\u0022twitter:title\u0022 content=\u0022\u68ee\u4f1a\u9577\u306e\u767a\u8a00\u554f\u984c\u3001\u307e\u305a\u7d44\u7e54\u59d4\u306e\u5bfe\u5fdc\u307f\u3066\u8003\u3048\u308b\uff1d\u52a0\u85e4\u5b98\u623f\u9577\u5b98\u0022\u003E\u003Cmeta name=\u0022twitter:image\u0022 content=\u0022https:\/\/www.reuters.com\/pf\/resources\/images\/reuters\/reuters-default.webp?d=216\u0022\u003E\u003Cmeta name=\u0022twitter:image:alt\u0022 content=\u0022Reuters logo\u0022\u003E\u003Cmeta name=\u0022keywords\u0022 content=\u0022ASIA,ASXPAC,COMDIS,COVID,EASIA,ECON,GEN,GENHLT,HEA,HUMDIS,INFDIS,JDOM,JLN,JP,MOF,OLY,PLCY,POL,PUBHEA,SPO,WOM\u0022\u003E\u003Cmeta name=\u0022description\u0022 content=\u0022\u52a0\u85e4\u52dd\u4fe1\u5b98\u623f\u9577\u5b98\u306f\uff14\u65e5\u5348\u524d\u306e\u4f1a\u898b\u3067\u3001\u6771\u4eac\u4e94\u8f2a\u30fb\u30d1\u30e9\u30ea\u30f3\u30d4\u30c3\u30af\u7d44\u7e54\u59d4\u54e1\u4f1a\u306e\u68ee\u559c\u6717\u4f1a\u9577\u306e\u767a\u8a00\u304c\u5973\u6027\u8511\u8996\u3068\u6279\u5224\u3055\u308c\u3001\u8f9e\u4efb\u3092\u8981\u6c42\u3059\u308b\u58f0\u3082\u3042\u308b\u3053\u3068\u306b\u3064\u3044\u3066\u3001\u653f\u5e9c\u3068\u3057\u3066\u30b3\u30e1\u30f3\u30c8\u3057\u306a\u3044\u3068\u3057\u305f\u4e0a\u3067\u3001\u307e\u305a\u306f\u4e94\u8f2a\u7d44\u7e54\u59d4\u54e1\u4f1a\u3067\u306e\u5bfe\u5fdc\u304c\u57fa\u672c\u3060\u3068\u8ff0\u3079\u305f\u3002\u0022\u003E\u003Cmeta property=\u0022fb:admins\u0022 content=\u0022988502044532272\u0022\u003E\u003Cmeta property=\u0022og:locale:alternate\u0022 content=\u0022en_US\u0022\u003E\u003Cmeta name=\u0022DCSext.DartZone\u0022 content=\u0022\/4735792\/reuters.com\/opinion\/article\u0022\u003E\u003Cmeta property=\u0022og:article:modified_time\u0022 content=\u00222021-02-04T03:17:14.000Z\u0022\u003E\u003Cmeta property=\u0022og:updated_time\u0022 content=\u00222021-02-04T03:17:14.000Z\u0022\u003E\u003Cmeta property=\u0022og:article:published_time\u0022 content=\u00222021-0...
curl --location --request GET 'https://zylalabs.com/api/5079/html+extractor+api/6470/source+url?urlSupplier=https://www.reuters.com/article/us-usa-economy-idUSKBN2A40BO&forceCache=True' --header 'Authorization: Bearer YOUR_API_KEY'
Header | Description |
---|---|
Authorization
|
[Required] Should be Bearer access_key . See "Your API Access Key" above when you are subscribed. |
No long term commitments. One click upgrade/downgrade or cancellation. No questions asked.
To use this API, you send a request with the URL of the web page and receive the full HTML content for parsing and extraction.
The HTML Extractor API fetches the complete HTML code from a web page, making it easy to parse and extract data from the content.
There are different plans suits everyone including a free trial for small amount of requests, but itβs rate is limit to prevent abuse of the service.
Zyla provides a wide range of integration methods for almost all programming languages. You can use these codes to integrate with your project as you need.
The API returns detailed information about the age and history of a domain, including years, months and days since its creation, as well as expiration and update dates.
Zyla API Hub is like a big store for APIs, where you can find thousands of them all in one place. We also offer dedicated support and real-time monitoring of all APIs. Once you sign up, you can pick and choose which APIs you want to use. Just remember, each API needs its own subscription. But if you subscribe to multiple ones, you'll use the same key for all of them, making things easier for you.
Prices are listed in USD (United States Dollar), EUR (Euro), CAD (Canadian Dollar), AUD (Australian Dollar), and GBP (British Pound). We accept all major debit and credit cards. Our payment system uses the latest security technology and is powered by Stripe, one of the worldβs most reliable payment companies. If you have any trouble paying by card, just contact us at [email protected]
Additionally, if you already have an active subscription in any of these currencies (USD, EUR, CAD, AUD, GBP), that currency will remain for subsequent subscriptions. You can change the currency at any time as long as you don't have any active subscriptions.
The local currency shown on the pricing page is based on the country of your IP address and is provided for reference only. The actual prices are in USD (United States Dollar). When you make a payment, the charge will appear on your card statement in USD, even if you see the equivalent amount in your local currency on our website. This means you cannot pay directly with your local currency.
Occasionally, a bank may decline the charge due to its fraud protection settings. We suggest reaching out to your bank initially to check if they are blocking our charges. Also, you can access the Billing Portal and change the card associated to make the payment. If these does not work and you need further assistance, please contact our team at [email protected]
Prices are determined by a recurring monthly or yearly subscription, depending on the chosen plan.
API calls are deducted from your plan based on successful requests. Each plan comes with a specific number of calls that you can make per month. Only successful calls, indicated by a Status 200 response, will be counted against your total. This ensures that failed or incomplete requests do not impact your monthly quota.
Zyla API Hub works on a recurring monthly subscription system. Your billing cycle will start the day you purchase one of the paid plans, and it will renew the same day of the next month. So be aware to cancel your subscription beforehand if you want to avoid future charges.
To upgrade your current subscription plan, simply go to the pricing page of the API and select the plan you want to upgrade to. The upgrade will be instant, allowing you to immediately enjoy the features of the new plan. Please note that any remaining calls from your previous plan will not be carried over to the new plan, so be aware of this when upgrading. You will be charged the full amount of the new plan.
To check how many API calls you have left for the current month, look at the βX-Zyla-API-Calls-Monthly-Remainingβ header. For example, if your plan allows 1000 requests per month and you've used 100, this header will show 900.
To see the maximum number of API requests your plan allows, check the βX-Zyla-RateLimit-Limitβ header. For instance, if your plan includes 1000 requests per month, this header will display 1000.
The βX-Zyla-RateLimit-Resetβ header shows the number of seconds until your rate limit resets. This tells you when your request count will start fresh. For example, if it displays 3600, it means 3600 seconds are left until the limit resets.
Yes, you can cancel your plan anytime by going to your account and selecting the cancellation option on the Billing page. Please note that upgrades, downgrades, and cancellations take effect immediately. Additionally, upon cancellation, you will no longer have access to the service, even if you have remaining calls left in your quota.
You can contact us through our chat channel to receive immediate assistance. We are always online from 8 am to 5 pm (EST). If you reach us after that time, we will get back to you as soon as possible. Additionally, you can contact us via email at [email protected]
Service Level:
100%
Response Time:
6,225ms
Service Level:
100%
Response Time:
4,978ms
Service Level:
100%
Response Time:
1,500ms
Service Level:
100%
Response Time:
18,858ms
Service Level:
100%
Response Time:
285ms
Service Level:
100%
Response Time:
14,716ms
Service Level:
100%
Response Time:
1,419ms
Service Level:
100%
Response Time:
471ms
Service Level:
100%
Response Time:
3,497ms
Service Level:
100%
Response Time:
1,583ms
Service Level:
100%
Response Time:
2,560ms
Service Level:
100%
Response Time:
2,016ms
Service Level:
100%
Response Time:
1,277ms
Service Level:
100%
Response Time:
10,779ms
Service Level:
100%
Response Time:
146ms
Service Level:
100%
Response Time:
1,187ms
Service Level:
100%
Response Time:
811ms