Anyone could help?
I was trying to delete the web addresses from this page:
https://www.paginasamarillas.com/ (I've moved the .com from the site name)
What busca? (What are you looking for?): Zapatos
In the world? (where?): Colombia
The results are 6405 companies in 321 pages. If I can get the websites from these 321 pages in a single code, that would be great. If not, I can copy the 321 web addresses and forward them to the scraper to get the websites from these pages.
I can not extract web addresses from the results using the custom crawler of the premium e-mail plug-in.
If I hit Control U, it will give me the HTML code of the page.
Then I can look for an example of a website where I want to find the HTML markers around this site.
Allows you to choose: masdotaciones .com as an example. All other sites should have the same markers.
Someone might tell me what I'm doing wrong, why do not I get some scrapers?
The html codes around the website example that I found are 3 of them:
href = "[/url]
http: //www.masdotaciones .com
2 and 3.
onclick = "return clickLinkIntegration (this, TipoClick.CLIC_WEB," 80029059, "1277273", LOCALITY_ID, LOCALITY_TERM, CATEGORY_ID, Pagina.PAGE_RESULTADOS, & # 39;); & nbsp; & nbsp; & nbsp; & nbsp; & nbsp; & nbsp; & nbsp; & nbsp; & nbsp; & nbsp; & nbsp; & nbsp; & nbsp; & nbsp;); "
href = "h[url=http://www.masdotaciones.com/]ttp: //www.masdotaciones .com "class =" linkLink "
title = "Visit the website of Mas Dotaciones Y CIA Ltda." target = "_ blank" rel = "nofollow"
itemprop = "url"> http: //www.masdotaciones .com