|
Google search issues |
FMC
Member #4,431
March 2004
|
I'm having a hard time getting Google understand my query exactly as intended. I'm doing a specific search on all sites about a certain something which link to rapidshare (et similia), which i translated as (in example i'm searching for mountain photos): If you try this query you'll see the first results are all noise from either filehunt or rapidlibrary, so i filtered the output: Problem is, sometimes, it doesn't and it still show results with rapidlibrary (or similar) in the url. +diablo +3 "megaupload.com” OR "filecoast.com/?file=" OR “rapidshare.com” OR “bigupload.com/files/” OR “filecoast.com” OR “uploadmb.com/dw.php” OR “axifile.com/fi.php?” OR “badongo.com/file/” OR "bigfilez.com" OR “filehost.ro” -inurl:rapidlibrary -inurl:fileshunt [/url](4th result is rapidlibrary) How would you explain this? [FMC Studios] - [Caries Field] - [Ctris] - [Pman] - [Chess for allegroites] |
bamccaig
Member #7,536
July 2006
|
-- acc.js | al4anim - Allegro 4 Animation library | Allegro 5 VS/NuGet Guide | Allegro.cc Mockup | Allegro.cc <code> Tag | Allegro 4 Timer Example (w/ Semaphores) | Allegro 5 "Winpkg" (MSVC readme) | Bambot | Blog | C++ STL Container Flowchart | Castopulence Software | Check Return Values | Derail? | Is This A Discussion? Flow Chart | Filesystem Hierarchy Standard | Clean Code Talks - Global State and Singletons | How To Use Header Files | GNU/Linux (Debian, Fedora, Gentoo) | rot (rot13, rot47, rotN) | Streaming |
FMC
Member #4,431
March 2004
|
What? [FMC Studios] - [Caries Field] - [Ctris] - [Pman] - [Chess for allegroites] |
bamccaig
Member #7,536
July 2006
|
Search for "Domain search" on Google's Web Search Help Center (the page I linked has it so you can just search for the text on that page). Google allows you to specify matching domains with the special site: option. You basically specify a matching domain name by saying site:domain. You can also exclude domains, AFAIK, by putting a minus sign (-) in front of the site:domain term, as in -site:domain; though if you're using site:domain to limit it to specific domains you wouldn't need to exclude any. AFAIK, you should be able to use OR to specify more than one matching domain. -- acc.js | al4anim - Allegro 4 Animation library | Allegro 5 VS/NuGet Guide | Allegro.cc Mockup | Allegro.cc <code> Tag | Allegro 4 Timer Example (w/ Semaphores) | Allegro 5 "Winpkg" (MSVC readme) | Bambot | Blog | C++ STL Container Flowchart | Castopulence Software | Check Return Values | Derail? | Is This A Discussion? Flow Chart | Filesystem Hierarchy Standard | Clean Code Talks - Global State and Singletons | How To Use Header Files | GNU/Linux (Debian, Fedora, Gentoo) | rot (rot13, rot47, rotN) | Streaming |
CGamesPlay
Member #2,559
July 2002
|
What if you use parentheses to specify (A OR B OR C) -D -E? -- Ryan Patterson - <http://cgamesplay.com/> |
FMC
Member #4,431
March 2004
|
@bamccaig: i knew about site but there are still cases in which it doesn't work. @CGamesPlay: it doesn't seem to matter if i use parentheses. It's really weird, in most cases it works as expected, but in others it doesn't. [FMC Studios] - [Caries Field] - [Ctris] - [Pman] - [Chess for allegroites] |
CGamesPlay
Member #2,559
July 2002
|
Well, the way Google's architecture works means that your query goes to hundreds of separate computers, and some of them will have copies of the indexes sites you're looking for, and some of them won't. The most commonly used results are cached all over the place, but doing specific searches will return varied results. This isn't a bug, just a product of the design. -- Ryan Patterson - <http://cgamesplay.com/> |
bamccaig
Member #7,536
July 2006
|
FMC said:
@bamccaig: i knew about site but there are still cases in which it doesn't work.
Search query said: "diablo 3" "megaupload.com” OR "filecoast.com/?file=" \ OR “rapidshare.com” OR “bigupload.com/files/” OR “filecoast.com” \ OR “uploadmb.com/dw.php” OR “axifile.com/fi.php?” OR “badongo.com/file/” \ OR "bigfilez.com" OR “filehost.ro” -site:rapidlibrary.com -inurl:fileshunt
5th result said:
Diablo 3 trailer - Megaupload Search I think I fail to see what didn't work about it... Maybe the results have changed? -- acc.js | al4anim - Allegro 4 Animation library | Allegro 5 VS/NuGet Guide | Allegro.cc Mockup | Allegro.cc <code> Tag | Allegro 4 Timer Example (w/ Semaphores) | Allegro 5 "Winpkg" (MSVC readme) | Bambot | Blog | C++ STL Container Flowchart | Castopulence Software | Check Return Values | Derail? | Is This A Discussion? Flow Chart | Filesystem Hierarchy Standard | Clean Code Talks - Global State and Singletons | How To Use Header Files | GNU/Linux (Debian, Fedora, Gentoo) | rot (rot13, rot47, rotN) | Streaming |
CGamesPlay
Member #2,559
July 2002
|
It might help if you used real quotes, rather than those stylized things. -- Ryan Patterson - <http://cgamesplay.com/> |
bamccaig
Member #7,536
July 2006
|
CGamesPlay said: It might help if you used real quotes, rather than those stylized things. In case you were talking to me, I copied it from the linked page... I was thinking the same thing though. -- acc.js | al4anim - Allegro 4 Animation library | Allegro 5 VS/NuGet Guide | Allegro.cc Mockup | Allegro.cc <code> Tag | Allegro 4 Timer Example (w/ Semaphores) | Allegro 5 "Winpkg" (MSVC readme) | Bambot | Blog | C++ STL Container Flowchart | Castopulence Software | Check Return Values | Derail? | Is This A Discussion? Flow Chart | Filesystem Hierarchy Standard | Clean Code Talks - Global State and Singletons | How To Use Header Files | GNU/Linux (Debian, Fedora, Gentoo) | rot (rot13, rot47, rotN) | Streaming |
FMC
Member #4,431
March 2004
|
@bamccaig: this is really weird, 10 minutes ago the result was the one you saw, now it's this (red highlight by me): Quote: Well, the way Google's architecture works means that your query goes to hundreds of separate computers, and some of them will have copies of the indexes sites you're looking for, and some of them won't. The most commonly used results are cached all over the place, but doing specific searches will return varied results. This isn't a bug, just a product of the design. The bug I was pointing at, is that I specified to NOT get results with a certain word in the url, and still get them. Quote: It might help if you used real quotes, rather than those stylized things. "diablo 3" site:megaupload.com OR inurl:"filecoast.com/?file=" OR site:rapidshare.com OR inurl:"bigupload.com/files/" OR site:filecoast.com OR inurl:"uploadmb.com/dw.php" OR inurl:"axifile.com/fi.php?" OR inurl:"badongo.com/file/" OR "bigfilez.com" OR site:filehost.ro [google.com] Except your query only returns results FROM one of those sites, which never was my goal. I said: I'm doing a specific search on all sites about a certain something which link to rapidshare (et similia) I'm interested in the sites which link to the file hosting portals, not the portals themselves. And i do now about the link: tool, but i prefer to not use to be able to catch malformed URL and broken links. [FMC Studios] - [Caries Field] - [Ctris] - [Pman] - [Chess for allegroites] |
CGamesPlay
Member #2,559
July 2002
|
Quote: Except your query only returns results FROM one of those sites, which never was my goal. Does it? I just did it and the first page was 4 distinct sites. Still, it isn't what you want, you're right. -- Ryan Patterson - <http://cgamesplay.com/> |
bamccaig
Member #7,536
July 2006
|
FMC said: @bamccaig: this is really weird, 10 minutes ago the result was the one you saw, now it's this (red highlight by me)... That looks like the 4th result to me. At least now I see what you're saying. Google, WTF!? (I don't mean it, Google. I still love you... We're probably just missing something) -- acc.js | al4anim - Allegro 4 Animation library | Allegro 5 VS/NuGet Guide | Allegro.cc Mockup | Allegro.cc <code> Tag | Allegro 4 Timer Example (w/ Semaphores) | Allegro 5 "Winpkg" (MSVC readme) | Bambot | Blog | C++ STL Container Flowchart | Castopulence Software | Check Return Values | Derail? | Is This A Discussion? Flow Chart | Filesystem Hierarchy Standard | Clean Code Talks - Global State and Singletons | How To Use Header Files | GNU/Linux (Debian, Fedora, Gentoo) | rot (rot13, rot47, rotN) | Streaming |
FMC
Member #4,431
March 2004
|
@bamccaig: it was fifth this afternoon, it apparently changed a couple times. Unluckily it may just be an internal inconsistency, it isn't alone: http://www.searchengineshowdown.com/features/google/inconsistent.shtml [FMC Studios] - [Caries Field] - [Ctris] - [Pman] - [Chess for allegroites] |
BAF
Member #2,981
December 2002
|
Well, that page you linked is from 2003, and I believe only some of those quirks still exist these days. |
FMC
Member #4,431
March 2004
|
Quote: Well, that page you linked is from 2003, and I believe only some of those quirks still exist these days. Actually it seems most of them are still there. [FMC Studios] - [Caries Field] - [Ctris] - [Pman] - [Chess for allegroites] |
Jeff Bernard
Member #6,698
December 2005
|
If it's just a matter of searching rapidshare and megaupload, there are websites that'll search for links for you. Just google rapidsearch or 4megaupload. -- |
FMC
Member #4,431
March 2004
|
Quote: If it's just a matter of searching rapidshare and megaupload, there are websites that'll search for links for you. Just google rapidsearch or 4megaupload. Thanks, but i have more fun fiddling with google directly. [FMC Studios] - [Caries Field] - [Ctris] - [Pman] - [Chess for allegroites] |
|