Google search issues
FMC

I'm having a hard time getting Google understand my query exactly as intended.

I'm doing a specific search on all sites about a certain something which link to rapidshare (et similia), which i translated as (in example i'm searching for mountain photos):
[url http://www.google.com/search?q=mountain+photos+rapidshare+OR+megaupload]
mountain photos rapidshare OR megaupload[/url]

If you try this query you'll see the first results are all noise from either filehunt or rapidlibrary, so i filtered the output:
[url http://www.google.com/search?q=mountain+photos+rapidshare+OR+megaupload+-inurl%3Afileshunt+-inurl%3Arapidlibrary]
mountain photos rapidshare OR megaupload -inurl:fileshunt -inurl:rapidlibrary[/url]
Which, in this case, works

Problem is, sometimes, it doesn't and it still show results with rapidlibrary (or similar) in the url. ???
Ex:[url http://www.google.com/search?q=%2Bdiablo+%2B3+%22megaupload.com%94+OR++%22filecoast.com%2F%3Ffile%3D%22+OR++%93rapidshare.com%94+OR++%93bigupload.com%2Ffiles%2F%94+OR+++%93filecoast.com%94+OR++%93uploadmb.com%2Fdw.php%94+OR++%93axifile.com%2Ffi.php%3F%94+OR++%93badongo.com%2Ffile%2F%94+OR++%22bigfilez.com%22+OR++%93filehost.ro%94+-inurl%3Arapidlibrary+-inurl%3Afileshunt]

+diablo +3 "megaupload.com” OR  "filecoast.com/?file=" OR  “rapidshare.com” OR  “bigupload.com/files/” OR   “filecoast.com” 
OR  “uploadmb.com/dw.php” OR  “axifile.com/fi.php?” OR  “badongo.com/file/” OR  "bigfilez.com" OR  “filehost.ro” 
-inurl:rapidlibrary -inurl:fileshunt

[/url](4th result is rapidlibrary)

How would you explain this?

bamccaig

site:subdomain1.domain1.tld1 OR site:subdomain2.domain2.tld2

???

FMC

What?

bamccaig

Search for "Domain search" on Google's Web Search Help Center (the page I linked has it so you can just search for the text on that page). Google allows you to specify matching domains with the special site: option. You basically specify a matching domain name by saying site:domain. You can also exclude domains, AFAIK, by putting a minus sign (-) in front of the site:domain term, as in -site:domain; though if you're using site:domain to limit it to specific domains you wouldn't need to exclude any. :P AFAIK, you should be able to use OR to specify more than one matching domain.

CGamesPlay

What if you use parentheses to specify (A OR B OR C) -D -E?

FMC

@bamccaig: i knew about site but there are still cases in which it doesn't work.
Try <a href="http://www.google.com/search?q="diablo+3"+"megaupload.com”+OR++"filecoast.com%2F%3Ffile%3D"+OR++“rapidshare.com”+OR++“bigupload.com%2Ffiles%2F”+OR+++“filecoast.com”+OR++“uploadmb.com%2Fdw.php”+OR++“axifile.com%2Ffi.php%3F”+OR++“badongo.com%2Ffile%2F”+OR++"bigfilez.com"+OR++“filehost.ro”+-site%3Arapidlibrary.com+-inurl%3Afileshunt">this.</a> (fifth result)

@CGamesPlay: it doesn't seem to matter if i use parentheses.

It's really weird, in most cases it works as expected, but in others it doesn't.
I'd be quite tempted to say "bug" if we weren't talking of google...

CGamesPlay

Well, the way Google's architecture works means that your query goes to hundreds of separate computers, and some of them will have copies of the indexes sites you're looking for, and some of them won't. The most commonly used results are cached all over the place, but doing specific searches will return varied results.

This isn't a bug, just a product of the design.

bamccaig
FMC said:

@bamccaig: i knew about site but there are still cases in which it doesn't work.
Try this. (fifth result)

Search query said:

"diablo 3" "megaupload.com” OR "filecoast.com/?file=" \ OR “rapidshare.com” OR “bigupload.com/files/” OR “filecoast.com” \ OR “uploadmb.com/dw.php” OR “axifile.com/fi.php?” OR “badongo.com/file/” \ OR "bigfilez.com" OR “filehost.ro” -site:rapidlibrary.com -inurl:fileshunt

5th result said:

Diablo 3 trailer - Megaupload Search
Megaupload File Search Results: 1-25 of 42667 for "diablo 3 trailer" ...
http://www.megaupload.com/?d=G5MK..DG. Search tags: Disgaea Japanese Trailer ...
4megaupload.com/index.php?q=diablo+3+trailer - 58k - Cached - Similar pages - Note this

I think I fail to see what didn't work about it... :-/ Maybe the results have changed?

CGamesPlay
bamccaig
CGamesPlay said:

It might help if you used real quotes, rather than those stylized things.

In case you were talking to me, I copied it from the linked page... :-X I was thinking the same thing though.

FMC

@bamccaig: this is really weird, 10 minutes ago the result was the one you saw, now it's this (red highlight by me):
http://www.allegro.cc/files/attachment/596160

Quote:

Well, the way Google's architecture works means that your query goes to hundreds of separate computers, and some of them will have copies of the indexes sites you're looking for, and some of them won't. The most commonly used results are cached all over the place, but doing specific searches will return varied results.

This isn't a bug, just a product of the design.

The bug I was pointing at, is that I specified to NOT get results with a certain word in the url, and still get them.

Quote:

It might help if you used real quotes, rather than those stylized things.

"diablo 3" site:megaupload.com OR inurl:"filecoast.com/?file=" OR site:rapidshare.com OR inurl:"bigupload.com/files/" OR site:filecoast.com OR inurl:"uploadmb.com/dw.php" OR inurl:"axifile.com/fi.php?" OR inurl:"badongo.com/file/" OR "bigfilez.com" OR site:filehost.ro [google.com]

Except your query only returns results FROM one of those sites, which never was my goal.

I said:

I'm doing a specific search on all sites about a certain something which link to rapidshare (et similia)

I'm interested in the sites which link to the file hosting portals, not the portals themselves.

And i do now about the link: tool, but i prefer to not use to be able to catch malformed URL and broken links.

CGamesPlay
Quote:

Except your query only returns results FROM one of those sites, which never was my goal.

Does it? I just did it and the first page was 4 distinct sites.

Still, it isn't what you want, you're right.

bamccaig
FMC said:

@bamccaig: this is really weird, 10 minutes ago the result was the one you saw, now it's this (red highlight by me)...

That looks like the 4th result to me. :-X At least now I see what you're saying. :) Google, WTF!? >:(

(I don't mean it, Google. I still love you... We're probably just missing something)

FMC

@bamccaig: it was fifth this afternoon, it apparently changed a couple times.

Unluckily it may just be an internal inconsistency, it isn't alone: http://www.searchengineshowdown.com/features/google/inconsistent.shtml :(

BAF

Well, that page you linked is from 2003, and I believe only some of those quirks still exist these days.

FMC
Quote:

Well, that page you linked is from 2003, and I believe only some of those quirks still exist these days.

Actually it seems most of them are still there.
But clearly -inurl: is bugged, i'm afraid there is nothing i can do to make it work.

Jeff Bernard

If it's just a matter of searching rapidshare and megaupload, there are websites that'll search for links for you.

Just google rapidsearch or 4megaupload.

FMC
Quote:

If it's just a matter of searching rapidshare and megaupload, there are websites that'll search for links for you.

Just google rapidsearch or 4megaupload.

Thanks, but i have more fun fiddling with google directly. :)

Thread #597413. Printed from Allegro.cc