Web Automation & Web Data Extraction
This allows to use the Web both as a data source (e.g. extracting product pricing information from competitor websites) and as a way to interact with external data sources (e.g. to automate operations with a B2B partner) in your company's Data Virtualization and Data Federation scenarios.
Web Automation
The Denodo Platform provides, among others, the following Web Automation features:
- Record navigation sequences of any complexity by simply navigating in a web browser. Sources using AJAX and Javascript technologies are fully supported, regardless of their level of complexity. Pop-ups, authentication, or browser dialogs are also transparently handled.
- Graphically design Web Automation processes of any complexity by combining navigation sequences and data extraction actions using loops, conditions, filters and transformation functions. Define templates for common workflow schemes so less advanced users do not need to create the workflows themselves, but can create wrappers by simply specifying navigation sequences and data examples.
- Dual design process that enables advanced users to be highly efficient by describing the Web Automation process either graphically or with the Denodo state-of-the-art browsing and extraction languages.
- Maximum performance optimization: server-side browser pool to reuse browsers and website sessions, allowing hundreds of concurrent outgoing wrappers running on a single server. Three different navigation sequence environments are available: Microsoft Internet Explorer and Firefox ensure the ability to replicate any navigation sequence in any website while the Denodo Browser maximizes performance.
Web Data Extraction
The Denodo Platform provides, among others, the following Web Data Extraction features:
- Specify the web data you wish to extract from each page by simply marking examples of the desired data in the browser. Extract the information in fully structured form, so you can process the data with the same power as structured data in a relational database.
- Leverage Denodo's advanced Web Data Extraction technology to extract structured data from PDF, Word and Excel documents
- Automatically detect source changes affecting the effectiveness of the wrapper and, in certain cases, even automatically correct the wrapper to adjust to the new situation.
- Publish the created wrappers in the Denodo Data Services Platform, so they can be combined at will with any other data source; and publish the wrappers directly as a Data Service, so your applications can gain programmatic access to the data and services provided by any website, the Cloud and SaaS application.

