HTML Extractor API

The HTML Extractor API extracts and returns the full HTML content of a web page, making it easy to analyze and extract data from websites.

About the API:  

The HTML Extractor API is an advanced tool designed to facilitate the extraction and analysis of data from web pages by retrieving the full HTML content of those pages. This API is useful for users, who need to access information contained in web sites for various purposes, such as market research, competition monitoring, or web application development.

Main Features:


Full HTML Code Retrieval: The main function of the HTML Extractor API is to capture the complete HTML code of a specific web page. This includes all the structural content of the page, such as tags, attributes and embedded elements. By obtaining the complete HTML, users can have access to all visible and hidden information on the page, allowing for a comprehensive analysis of the content.

Support for Different Types of Web Pages: The API is versatile and supports a wide range of Web sites, from static pages to dynamic sites that generate content using JavaScript. The ability to handle different types of content makes the API suitable for a variety of applications, such as news data collection, social network monitoring, and complex web page structure analysis.

Specific Data Extraction: Although the API provides the full HTML, it can also be used to extract specific page data. Users can combine the API with HTML parsing techniques, such as the use of regular expressions or HTML processing libraries, to extract particular information such as product prices, contact details or any other relevant data.

In summary, the HTML Extractor API is a powerful and flexible tool for extracting HTML content from web pages. It offers an effective solution for those who need full access to web page content for analysis, research or development.Its ability to handle a variety of page types and its easy integration make it a valuable option for numerous use cases in web data management and analysis.

 

What this API receives and what your API provides (input/output)?

The API receives a URL of a web page and provides the full HTML content of that page for analysis and data extraction.

 

What are the most common uses cases of this API?

  1. Competitor Research: Collect content from competitors' websites to analyze prices, products, promotions and marketing strategies.

    News Monitoring: Extract content from news sites to keep up with the latest events and updates in real time.

    Data Collection for Academic Research: Obtain and analyze content from multiple websites for academic research or case studies.

    Web Application Development: Use the API to extract and parse HTML from the web applications themselves during development and testing.

    SEO Analysis: Extract HTML from web pages to analyze important SEO elements such as meta tags, headings, and link structure.

     

Are there any limitations to your plans?

Beside the number of API calls per month allowed, there are no other limitations.

API Documentation

Endpoints


To use this endpoint, send an HTTP request with the URL of the desired page and receive the full HTML content of the page.



                                                                            
GET https://zylalabs.com/api/5079/html+extractor+api/6470/source+url
                                                                            
                                                                        

Source Url - Endpoint Features

Object Description
urlSupplier [Required] String
forceCache [Required] boolean
Test Endpoint

API EXAMPLE RESPONSE

       
                                                                                                        
                                                                                                                                                                                                                            {"method":"GET","urlSupplier":"https:\/\/www.reuters.com\/article\/us-usa-economy-idUSKBN2A40BO","redirectedUrlSupplier":"https:\/\/www.reuters.com\/article\/us-usa-economy-idUSKBN2A40BO\/","pageSource":"\u003Chtml lang=\u0022ja\u0022 data-layout=\u0022regular-article\u0022\u003E\u003Chead\u003E\u003Ctitle\u003E\u68ee\u4f1a\u9577\u306e\u767a\u8a00\u554f\u984c\u3001\u307e\u305a\u7d44\u7e54\u59d4\u306e\u5bfe\u5fdc\u307f\u3066\u8003\u3048\u308b\uff1d\u52a0\u85e4\u5b98\u623f\u9577\u5b98 | \u30ed\u30a4\u30bf\u30fc\u003C\/title\u003E\u003Cmeta name=\u0022viewport\u0022 content=\u0022width=device-width, initial-scale=1\u0022\u003E\u003Cmeta name=\u0022apple-itunes-app\u0022 content=\u0022app-id=602660809, app-argument=https:\/\/www.reuters.com\/article\/opinion\/-idUSKBN2A40BN\/?id=KBN2A40BO\u0022\u003E\u003Cscript async=\u0022\u0022 src=\u0022https:\/\/js.datadome.co\/tags.js\u0022\u003E\u003C\/script\u003E\u003Cscript src=\u0022https:\/\/geolocation.onetrust.com\/cookieconsentpub\/v1\/geo\/location\/dnsfeed\u0022 async=\u0022\u0022 type=\u0022text\/javascript\u0022\u003E\u003C\/script\u003E\u003Cscript src=\u0022\/\/tru.am\/scripts\/ta-pagesocial-sdk.js\u0022\u003E\u003C\/script\u003E\u003Cscript type=\u0022text\/javascript\u0022 async=\u0022\u0022 src=\u0022\/\/img.en25.com\/i\/elqCfg.min.js\u0022\u003E\u003C\/script\u003E\u003Cscript type=\u0022text\/javascript\u0022 async=\u0022\u0022 src=\u0022https:\/\/cdn.segment.com\/analytics.js\/v1\/IEWBqQ8VWHijTQxb7lEBGFGS9uIJzigZ\/analytics.min.js\u0022\u003E\u003C\/script\u003E\u003Cscript async=\u0022\u0022 src=\u0022https:\/\/functionalfeather.com\/chunks\/4f0840710d72b\/1e91cf83e04ef163910e05eef.main.js\u0022\u003E\u003C\/script\u003E\u003Cscript async=\u0022\u0022 src=\u0022https:\/\/www.googletagmanager.com\/gtm.js?id=GTM-K5WTBZN\u0022\u003E\u003C\/script\u003E\u003Cscript\u003E(function(){\n      var current_location = window.location.href;\n\n      if (current_location.indexOf(\u0027\/info-pages\/supported-browsers\/\u0027) === -1) {\n        var supportFetchApi = \u0027fetch\u0027 in window;\n        var supportCSSGrid = window.CSS \u0026\u0026 CSS.supports(\u0027display\u0027, \u0027grid\u0027);\n\n        if (!supportFetchApi \u0026\u0026 !supportCSSGrid) {\n          window.location.href = \u0027\/info-pages\/supported-browsers\/\u0027;\n        }\n      }\n    })()\u003C\/script\u003E\u003Cscript src=\u0022\/pf\/resources\/dist\/reuters\/js\/index.js?d=216\u0022 async=\u0022\u0022 data-config=\u0022{\u0026quot;API_ORIGIN\u0026quot;:\u0026quot;https:\/\/api-reuters-reuters-prod.cdn.arcpublishing.com\u0026quot;,\u0026quot;ADMIN\u0026quot;:false,\u0026quot;BUSINESS_COUNTRIES\u0026quot;:[],\u0026quot;CHARTBEAT_CONFIG\u0026quot;:{\u0026quot;title\u0026quot;:\u0026quot;\u68ee\u4f1a\u9577\u306e\u767a\u8a00\u554f\u984c\u3001\u307e\u305a\u7d44\u7e54\u59d4\u306e\u5bfe\u5fdc\u307f\u3066\u8003\u3048\u308b\uff1d\u52a0\u85e4\u5b98\u623f\u9577\u5b98\u0026quot;,\u0026quot;authors\u0026quot;:\u0026quot;Reuters\u0026quot;,\u0026quot;type\u0026quot;:\u0026quot;regular-article\u0026quot;,\u0026quot;domain\u0026quot;:\u0026quot;reuters.com\u0026quot;},\u0026quot;DEPLOYMENT\u0026quot;:\u0026quot;216\u0026quot;,\u0026quot;ELOQUA_SITE_ID\u0026quot;:\u0026quot;2124157686\u0026quot;,\u0026quot;MAX_NUMBER_OF_UNICODE_CHARS\u0026quot;:256,\u0026quot;ONETRUST_SCRIPT_ID\u0026quot;:\u0026quot;38cb75bd-fbe1-4ac8-b4af-e531ab368caf\u0026quot;,\u0026quot;PARSELY_SITE_ID\u0026quot;:\u0026quot;reuters.com\u0026quot;,\u0026quot;SEGMENT_WRITE_KEY\u0026quot;:\u0026quot;IEWBqQ8VWHijTQxb7lEBGFGS9uIJzigZ\u0026quot;,\u0026quot;SEGMENT_WRITE_KEY_MOBILE\u0026quot;:\u0026quot;YlmAIaFBxsNtlVJdfuSV0ncE931ghRtS\u0026quot;,\u0026quot;GRAPHICS_PLUGIN_IFRAME_URL\u0026quot;:\u0026quot;https:\/\/sphinx.thomsonreuters.com\/search\/?consumer=PageBuilder#\/search\/graphic\u0026quot;,\u0026quot;COMSCORE_CLIENT_ID\u0026quot;:\u0026quot;37296053\u0026quot;,\u0026quot;PAGE_CATEGORY\u0026quot;:\u0026quot;\u0026quot;,\u0026quot;PAGE_TYPE\u0026quot;:\u0026quot;\u0026quot;}\u0022\u003E\u003C\/script\u003E\u003Cscript src=\u0022https:\/\/www.reuters.com\/arc\/subs\/p.min.js\u0022 async=\u0022\u0022\u003E\u003C\/script\u003E\u003Cmeta property=\u0022fb:app_id\u0022 content=\u0022988502044532272\u0022\u003E\u003Cmeta property=\u0022fb:pages\u0022 content=\u0022114050161948682\u0022\u003E\u003Cmeta name=\u0022robots\u0022 content=\u0022noarchive, max-image-preview:large\u0022\u003E\u003Cmeta name=\u0022CCBot\u0022 content=\u0022nofollow\u0022\u003E\u003Clink href=\u0022https:\/\/www.googletagmanager.com\u0022 rel=\u0022preconnect\u0022\u003E\u003Clink href=\u0022https:\/\/connect.facebook.net\u0022 rel=\u0022preconnect\u0022\u003E\u003Cmeta property=\u0022og:title\u0022 content=\u0022\u68ee\u4f1a\u9577\u306e\u767a\u8a00\u554f\u984c\u3001\u307e\u305a\u7d44\u7e54\u59d4\u306e\u5bfe\u5fdc\u307f\u3066\u8003\u3048\u308b\uff1d\u52a0\u85e4\u5b98\u623f\u9577\u5b98\u0022\u003E\u003Cmeta property=\u0022og:type\u0022 content=\u0022article\u0022\u003E\u003Cmeta property=\u0022og:image\u0022 content=\u0022https:\/\/www.reuters.com\/pf\/resources\/images\/reuters\/reuters-default.webp?d=216\u0022\u003E\u003Cmeta property=\u0022og:url\u0022 content=\u0022https:\/\/www.reuters.com\/article\/opinion\/-idUSKBN2A40BN\/\u0022\u003E\u003Cmeta property=\u0022og:description\u0022 content=\u0022\u52a0\u85e4\u52dd\u4fe1\u5b98\u623f\u9577\u5b98\u306f\uff14\u65e5\u5348\u524d\u306e\u4f1a\u898b\u3067\u3001\u6771\u4eac\u4e94\u8f2a\u30fb\u30d1\u30e9\u30ea\u30f3\u30d4\u30c3\u30af\u7d44\u7e54\u59d4\u54e1\u4f1a\u306e\u68ee\u559c\u6717\u4f1a\u9577\u306e\u767a\u8a00\u304c\u5973\u6027\u8511\u8996\u3068\u6279\u5224\u3055\u308c\u3001\u8f9e\u4efb\u3092\u8981\u6c42\u3059\u308b\u58f0\u3082\u3042\u308b\u3053\u3068\u306b\u3064\u3044\u3066\u3001\u653f\u5e9c\u3068\u3057\u3066\u30b3\u30e1\u30f3\u30c8\u3057\u306a\u3044\u3068\u3057\u305f\u4e0a\u3067\u3001\u307e\u305a\u306f\u4e94\u8f2a\u7d44\u7e54\u59d4\u54e1\u4f1a\u3067\u306e\u5bfe\u5fdc\u304c\u57fa\u672c\u3060\u3068\u8ff0\u3079\u305f\u3002\u0022\u003E\u003Cmeta property=\u0022og:locale\u0022 content=\u0022en_US\u0022\u003E\u003Cmeta property=\u0022og:site_name\u0022 content=\u0022Reuters\u0022\u003E\u003Cmeta name=\u0022article:published_time\u0022 content=\u00222021-02-04T03:22:13.000Z\u0022\u003E\u003Cmeta name=\u0022article:modified_time\u0022 content=\u00222021-02-04T03:17:14.000Z\u0022\u003E\u003Cmeta name=\u0022article:author\u0022 content=\u0022Reuters\u0022\u003E\u003Cmeta name=\u0022article:tag\u0022 content=\u0022ASIA,ASXPAC,COMDIS,COVID,EASIA,ECON,GEN,GENHLT,HEA,HUMDIS,INFDIS,JDOM,JLN,JP,MOF,OLY,PLCY,POL,PUBHEA,SPO,WOM\u0022\u003E\u003Cmeta property=\u0022og:image:url\u0022 content=\u0022https:\/\/www.reuters.com\/pf\/resources\/images\/reuters\/reuters-default.webp?d=216\u0022\u003E\u003Cmeta property=\u0022og:image:width\u0022 content=\u00221200\u0022\u003E\u003Cmeta property=\u0022og:image:height\u0022 content=\u0022628\u0022\u003E\u003Cmeta property=\u0022og:image:alt\u0022 content=\u0022Reuters logo\u0022\u003E\u003Cmeta name=\u0022twitter:card\u0022 content=\u0022summary_large_image\u0022\u003E\u003Cmeta name=\u0022twitter:site\u0022 content=\u0022@Reuters\u0022\u003E\u003Cmeta name=\u0022twitter:creator\u0022 content=\u0022@Reuters\u0022\u003E\u003Cmeta name=\u0022twitter:description\u0022 content=\u0022\u52a0\u85e4\u52dd\u4fe1\u5b98\u623f\u9577\u5b98\u306f\uff14\u65e5\u5348\u524d\u306e\u4f1a\u898b\u3067\u3001\u6771\u4eac\u4e94\u8f2a\u30fb\u30d1\u30e9\u30ea\u30f3\u30d4\u30c3\u30af\u7d44\u7e54\u59d4\u54e1\u4f1a\u306e\u68ee\u559c\u6717\u4f1a\u9577\u306e\u767a\u8a00\u304c\u5973\u6027\u8511\u8996\u3068\u6279\u5224\u3055\u308c\u3001\u8f9e\u4efb\u3092\u8981\u6c42\u3059\u308b\u58f0\u3082\u3042\u308b\u3053\u3068\u306b\u3064\u3044\u3066\u3001\u653f\u5e9c\u3068\u3057\u3066\u30b3\u30e1\u30f3\u30c8\u3057\u306a\u3044\u3068\u3057\u305f\u4e0a\u3067\u3001\u307e\u305a\u306f\u4e94\u8f2a\u7d44\u7e54\u59d4\u54e1\u4f1a\u3067\u306e\u5bfe\u5fdc\u304c\u57fa\u672c\u3060\u3068\u8ff0\u3079\u305f\u3002\u0022\u003E\u003Cmeta name=\u0022twitter:title\u0022 content=\u0022\u68ee\u4f1a\u9577\u306e\u767a\u8a00\u554f\u984c\u3001\u307e\u305a\u7d44\u7e54\u59d4\u306e\u5bfe\u5fdc\u307f\u3066\u8003\u3048\u308b\uff1d\u52a0\u85e4\u5b98\u623f\u9577\u5b98\u0022\u003E\u003Cmeta name=\u0022twitter:image\u0022 content=\u0022https:\/\/www.reuters.com\/pf\/resources\/images\/reuters\/reuters-default.webp?d=216\u0022\u003E\u003Cmeta name=\u0022twitter:image:alt\u0022 content=\u0022Reuters logo\u0022\u003E\u003Cmeta name=\u0022keywords\u0022 content=\u0022ASIA,ASXPAC,COMDIS,COVID,EASIA,ECON,GEN,GENHLT,HEA,HUMDIS,INFDIS,JDOM,JLN,JP,MOF,OLY,PLCY,POL,PUBHEA,SPO,WOM\u0022\u003E\u003Cmeta name=\u0022description\u0022 content=\u0022\u52a0\u85e4\u52dd\u4fe1\u5b98\u623f\u9577\u5b98\u306f\uff14\u65e5\u5348\u524d\u306e\u4f1a\u898b\u3067\u3001\u6771\u4eac\u4e94\u8f2a\u30fb\u30d1\u30e9\u30ea\u30f3\u30d4\u30c3\u30af\u7d44\u7e54\u59d4\u54e1\u4f1a\u306e\u68ee\u559c\u6717\u4f1a\u9577\u306e\u767a\u8a00\u304c\u5973\u6027\u8511\u8996\u3068\u6279\u5224\u3055\u308c\u3001\u8f9e\u4efb\u3092\u8981\u6c42\u3059\u308b\u58f0\u3082\u3042\u308b\u3053\u3068\u306b\u3064\u3044\u3066\u3001\u653f\u5e9c\u3068\u3057\u3066\u30b3\u30e1\u30f3\u30c8\u3057\u306a\u3044\u3068\u3057\u305f\u4e0a\u3067\u3001\u307e\u305a\u306f\u4e94\u8f2a\u7d44\u7e54\u59d4\u54e1\u4f1a\u3067\u306e\u5bfe\u5fdc\u304c\u57fa\u672c\u3060\u3068\u8ff0\u3079\u305f\u3002\u0022\u003E\u003Cmeta property=\u0022fb:admins\u0022 content=\u0022988502044532272\u0022\u003E\u003Cmeta property=\u0022og:locale:alternate\u0022 content=\u0022en_US\u0022\u003E\u003Cmeta name=\u0022DCSext.DartZone\u0022 content=\u0022\/4735792\/reuters.com\/opinion\/article\u0022\u003E\u003Cmeta property=\u0022og:article:modified_time\u0022 content=\u00222021-02-04T03:17:14.000Z\u0022\u003E\u003Cmeta property=\u0022og:updated_time\u0022 content=\u00222021-02-04T03:17:14.000Z\u0022\u003E\u003Cmeta property=\u0022og:article:published_time\u0022 content=\u00222021-0...
                                                                                                                                                                                                                    
                                                                                                    

Source Url - CODE SNIPPETS


curl --location --request GET 'https://zylalabs.com/api/5079/html+extractor+api/6470/source+url?urlSupplier=https://www.reuters.com/article/us-usa-economy-idUSKBN2A40BO&forceCache=True' --header 'Authorization: Bearer YOUR_API_KEY' 


    

API Access Key & Authentication

After signing up, every developer is assigned a personal API access key, a unique combination of letters and digits provided to access to our API endpoint. To authenticate with the HTML Extractor API REST API, simply include your bearer token in the Authorization header.
Headers
Header Description
Authorization [Required] Should be Bearer access_key. See "Your API Access Key" above when you are subscribed.

Simple Transparent Pricing

No long term commitments. One click upgrade/downgrade or cancellation. No questions asked.

πŸš€ Enterprise

Starts at
$ 10,000/Year


  • Custom Volume
  • Specialized Customer Support
  • Real-Time API Monitoring

Customer favorite features

  • βœ”οΈŽ Only Pay for Successful Requests
  • βœ”οΈŽ Free 7-Day Trial
  • βœ”οΈŽ Multi-Language Support
  • βœ”οΈŽ One API Key, All APIs.
  • βœ”οΈŽ Intuitive Dashboard
  • βœ”οΈŽ Comprehensive Error Handling
  • βœ”οΈŽ Developer-Friendly Docs
  • βœ”οΈŽ Postman Integration
  • βœ”οΈŽ Secure HTTPS Connections
  • βœ”οΈŽ Reliable Uptime

To use this API, you send a request with the URL of the web page and receive the full HTML content for parsing and extraction.

The HTML Extractor API fetches the complete HTML code from a web page, making it easy to parse and extract data from the content.

There are different plans suits everyone including a free trial for small amount of requests, but it’s rate is limit to prevent abuse of the service.

Zyla provides a wide range of integration methods for almost all programming languages. You can use these codes to integrate with your project as you need.

The API returns detailed information about the age and history of a domain, including years, months and days since its creation, as well as expiration and update dates.

Zyla API Hub is like a big store for APIs, where you can find thousands of them all in one place. We also offer dedicated support and real-time monitoring of all APIs. Once you sign up, you can pick and choose which APIs you want to use. Just remember, each API needs its own subscription. But if you subscribe to multiple ones, you'll use the same key for all of them, making things easier for you.

Prices are listed in USD (United States Dollar), EUR (Euro), CAD (Canadian Dollar), AUD (Australian Dollar), and GBP (British Pound). We accept all major debit and credit cards. Our payment system uses the latest security technology and is powered by Stripe, one of the world’s most reliable payment companies. If you have any trouble paying by card, just contact us at [email protected]

Additionally, if you already have an active subscription in any of these currencies (USD, EUR, CAD, AUD, GBP), that currency will remain for subsequent subscriptions. You can change the currency at any time as long as you don't have any active subscriptions.

The local currency shown on the pricing page is based on the country of your IP address and is provided for reference only. The actual prices are in USD (United States Dollar). When you make a payment, the charge will appear on your card statement in USD, even if you see the equivalent amount in your local currency on our website. This means you cannot pay directly with your local currency.

Occasionally, a bank may decline the charge due to its fraud protection settings. We suggest reaching out to your bank initially to check if they are blocking our charges. Also, you can access the Billing Portal and change the card associated to make the payment. If these does not work and you need further assistance, please contact our team at [email protected]

Prices are determined by a recurring monthly or yearly subscription, depending on the chosen plan.

API calls are deducted from your plan based on successful requests. Each plan comes with a specific number of calls that you can make per month. Only successful calls, indicated by a Status 200 response, will be counted against your total. This ensures that failed or incomplete requests do not impact your monthly quota.

Zyla API Hub works on a recurring monthly subscription system. Your billing cycle will start the day you purchase one of the paid plans, and it will renew the same day of the next month. So be aware to cancel your subscription beforehand if you want to avoid future charges.

To upgrade your current subscription plan, simply go to the pricing page of the API and select the plan you want to upgrade to. The upgrade will be instant, allowing you to immediately enjoy the features of the new plan. Please note that any remaining calls from your previous plan will not be carried over to the new plan, so be aware of this when upgrading. You will be charged the full amount of the new plan.

To check how many API calls you have left for the current month, refer to the β€˜X-Zyla-API-Calls-Monthly-Remaining’ field in the response header. For example, if your plan allows 1000 requests per month and you've used 100, this field in the response header will indicate 900 remaining calls.

To see the maximum number of API requests your plan allows, check the β€˜X-Zyla-RateLimit-Limit’ response header. For instance, if your plan includes 1000 requests per month, this header will display 1000.

The β€˜X-Zyla-RateLimit-Reset’ header shows the number of seconds until your rate limit resets. This tells you when your request count will start fresh. For example, if it displays 3600, it means 3600 seconds are left until the limit resets.

Yes, you can cancel your plan anytime by going to your account and selecting the cancellation option on the Billing page. Please note that upgrades, downgrades, and cancellations take effect immediately. Additionally, upon cancellation, you will no longer have access to the service, even if you have remaining calls left in your quota.

You can contact us through our chat channel to receive immediate assistance. We are always online from 8 am to 5 pm (EST). If you reach us after that time, we will get back to you as soon as possible. Additionally, you can contact us via email at [email protected]

To let you experience our APIs without any commitment, we offer a 7-day free trial that allows you to make API calls at no cost during this period. Please note that you can only use this trial once, so make sure to use it with the API that interests you the most. Most of our APIs provide a free trial, but some may not support it.

After 7 days, you will be charged the full amount for the plan you were subscribed to during the trial. Therefore, it’s important to cancel before the trial period ends. Refund requests for forgetting to cancel on time are not accepted.

When you subscribe to an API trial, you can make only 25% of the calls allowed by that plan. For example, if the API plan offers 1000 calls, you can make only 250 during the trial. To access the full number of calls offered by the plan, you will need to subscribe to the full plan.

 Service Level
100%
 Response Time
6,185ms

Category:


Related APIs