![]() |
Feel free to use this board to discuss anything related to searching matter.
|
Google does pdf (07/02/01 20.48.58)
http://www.searchenginewatch.com/sereport/01/02-google.html Google now includes listings of Adobe PDF files from across the web, a first for any major search engine Not all of Google's computers have been updated with the PDF information, which means that is pretty random as to whether you'll encounter the PDF-enhanced listings.[...]All the computers should be updated by the end of this week, he said. When that happens, users will have access to the full text of 13 million PDF files. They will appear mixed among the normal listings of HTML documents, when relevant for a particular query. However, PDF files will be prefaced by a [pdf] label next to their title. Want to restrict your search to just PDF files at Google? Make use of the inurl: command AFTER your search words. For instance, a search for colleges inurl:pdf tells Google to find documents containing the word colleges and which have pdf in their URL. So far, this seems to pretty much ensure that you only get back PDF files. Multiword queries also seem to work -- amazon ebay inurl:pdf brings back PDF files mentioning both of those companies, for example. [more google special commands:] allinurl: is supposed to tell Google to find ALL the words you specify after it, within the URL of a web page. In contrast, inurl: is supposed to find ANY of the words you specify after it in the URL, rather than all of them, Google says. You can also try allintitle: to find all the words you specify within the title of a document, while intitle: is supposed to find ANY of the words. There's also a site: command, which is useful for finding pages just from a particular site -- site:searchenginewatch.com google would bring up all the pages from Search Engine Watch that mention Google, for instance. To say we support the OR command would be a great overstatement. People have discovered if you type in a capital OR, Google does something with it," Silverstein said. What's happened is that Google has added some behind the scene logic to catch variations of words written with diacritical marks, such as accented words like pêches (French for peaches), even when they are written without the accents, Silverstein said. This logic means that if you do a search for two words, with a capital OR in the middle, you may get something similar to a Boolean OR working for you. [There's a couple of interesting links at the end of the article - have a look for yourself] |
| back to main board | expand thread |
|
This board is still under construction and so you shouldn't use it to discuss important stuff as it may
(and probably will :-) happen that everything get lost.
Anyway, here are some guidelines on how to use it :
When submitting a new post, check is done against offending tags. All tags are stripped from the name and email fields while
some tags are allowed for the subject and the content. You can always preview a post before actually submitting it by check 'preview before submit'
and you can be warned of any removed/replaced tags by checking 'Ask for confirmation if modified'.
Well, enough babbling, try it by yourself and report any bugs, comments, ideas, critics either on the board itself of by email to phplab@2113.ch |