Data tables can be extracted very easily from a web page, using the "Extract Data from Web Page" Action.
In order to extract a table, we should first extract a couple of items of the 1st column. Then, by extracting another item from another column, the whole table is going to be extracted (2- column table).
Should we want a 3rd column, then we can just extract another item of the 3rd column (3- column table).
All the rest of the configuration is being handles automatically by the Action.
However, it is important to know how the Action works, so that we can modify it accordingly if needed.
How it works
Should we choose to extract a 3 - column table, then the Advanced Settings of the Web Helper will look like the below
The Base CSS Selector is the common preceding part of all the additional CSS Selectors that are stated above.
For instance, if the Base CSS Selector is this:
html > body > div:eq(3) > div > div > section:eq(1) > div:eq(1) > div > div > div > div > div > table > tbody > tr
this means that when combined with the additional CSS selectors, it will give us the CSS selectors of the actual table columns.
Important note: The last element of the Base CSS Selector (in this case the element "tr") is actually repeated in sequence many times within the HTML code.
Hence, tr:eq(0) would correspond to the 1st item, tr:eq(1) to the 2nd one and so on - so that is what creates the extracted lists.
In this case that we have a table, we have more than one lists, which would actually be the table's columns.