Author Topic: Request For Browser Extension: Make Firefox Look Like The Google Spider  (Read 1502 times)

0 Members and 2 Guests are viewing this topic.

Offline JoeNTopic starter

  • Frequent Contributor
  • **
  • Posts: 991
  • Country: us
  • We Buy Trannies By The Truckload
Ever have this happen?  You search for something and Google gives you a nice synopsis of exactly what you are looking for and a link and you follow the link and immediately hit a paywall?  I was looking for "28nm gate capacitance" out of curiosity.  Google gives this synopsis:

[PDF] 24.7 A High-Performance, High-Density 28nm eDRAM ... - IEEE Xplore
ieeexplore.ieee.org/iel5/6123666/6131464/06131608.pdf
Metal-Insulator-Metal (MIM) capacitor and integrated into. 28nm High-K Metal Gate (HKMG) logic technology. This. eDRAM memory features an HKMG CMOS  ...

When you follow the link:

http://ieeexplore.ieee.org/iel5/6123666/6131464/06131608.pdf

An immediate paywall.  And the words above are not even hidden in the HTML.  How did Google read the PDF in order to index it properly?

I don't know, but I bet the web server is giving up the content if it thinks you are Google.  But unless it is going off of known IPs, the only way to know what is at the end of an HTTP connection is by the headers, which can be anything you want them to be.

Hmmm.....     >:D
Have You Been Triggered Today?
 

Offline Jeroen3

  • Super Contributor
  • ***
  • Posts: 4098
  • Country: nl
  • Embedded Engineer
    • jeroen3.nl
Re: Request For Browser Extension: Make Firefox Look Like The Google Spider
« Reply #1 on: January 25, 2017, 10:00:58 am »
One can submit the pdf to google to index without the crawler finding it himself.
 

Offline NottheDan

  • Frequent Contributor
  • **
  • Posts: 281
  • Country: gb
Re: Request For Browser Extension: Make Firefox Look Like The Google Spider
« Reply #2 on: January 25, 2017, 10:28:04 am »
Look for a User Agent switcher addon for your FF and use it to switch your browser's identification to on of the robots when needed.

https://addons.mozilla.org/en-GB/firefox/addon/user-agent-switcher/
 

Online amyk

  • Super Contributor
  • ***
  • Posts: 8341
Re: Request For Browser Extension: Make Firefox Look Like The Google Spider
« Reply #3 on: January 25, 2017, 12:20:26 pm »
Most of these use IP; I believe there was a way using one of Google's services as a proxy, but I don't know if that route has been blocked now. Even Google Translate used to work. Do some searching for more information on that... of course, for articles like that one, check SciHub/Libgen.
 

Offline tom66

  • Super Contributor
  • ***
  • Posts: 6836
  • Country: gb
  • Electronics Hobbyist & FPGA/Embedded Systems EE
Re: Request For Browser Extension: Make Firefox Look Like The Google Spider
« Reply #4 on: January 25, 2017, 12:25:16 pm »
Google penalises websites that present different content to their spiders...if you find a paywalled article try copying the title into Google and you'll often get access to it. If you don't, then Google should be seeing the same summary you get.
 


Share me

Digg  Facebook  SlashDot  Delicious  Technorati  Twitter  Google  Yahoo
Smf