Darwine ontwikkelt websites waar mensen met elkaar kennis maken of kennis met elkaar delen: mooi maatwerk.

We tonen u graag een aantal van onze succesvolle projecten.

link Referenties
link Route

ADRES
's-Gravenweg 190
3062 ZL Rotterdam
KvK 24348991 0000

tel 010-2133996
mob (Inne) 06-42804208
info@darwine.nl

<  >  naar bron Data Mining on the Internet with Google


bron artikelGoogle has quickly become one of the most well known words in the world and is used by millions daily, including myself. In an advanced database class back in university, we spent a couple of weeks studying the inner workings of search engines, and one topic which happened to come up was data mining using Google. Much to my surprise, out of a class of 80 fourth year computer engineers maybe four or five knew how to use Google to perform any sort of advanced queries.

Google (and many other search engines) has the ability not only to search on keywords, but also using a more “database-ish” query language to really narrow down your search results. Below is a summary of a few of the most useful lesser known features.


Lees verder...
Gepubliceerd door Matt op CyberWyre door


• KENNIS DELEN •
2006-06-08, door
Inne ten Have



Voyeur Heaven: Finding Unprotected Video And More 
We've all got a little voyeurism in us. That's why the recent article, Clearing Google Search History to Maintain Your Privacy sent my visitor counts off the charts

In this article, I'm going to show you how to create search queries that will list the contents of unprotected directories on the Internet.

You'll be able to play the music files, watch the videos, look at photos and more. I have to say, it's really addicting.

First of all, I want to say that I got the idea for this from reading this article and this article on Tech-Recipes.com. It's similar but I've improved on it a bit to make it more accurate so you don't have to wade through so many unrelated search listings.



2006-06-08, door Inne ten Have

You Are What You Search: AOL's data leak reveals the seven ways people search the Web 
AOL researchers recently published the search logs of about 650,000 members—a total of 36,389,629 individual searches. AOL's search nerds intended the files to be an academic resource but didn't consider that users might be peeved to see their private queries become a research tool. Last weekend, the Internet service provider tried to pull back the data, but by that point it had leaked all over the Web. If you've ever wanted to see what other people type into search boxes, now's your chance.

The search records don't include users' names, but each search is tagged with a number that's tied to a specific AOL account. The New York Times quickly sussed out that AOL Searcher No. 4417749 was 62-year-old Thelma Arnold. Indeed, Arnold has a "dog who urinate on everything," just as she'd typed into the search box. Valleywag has become one of many clearinghouses for funny, bizarre, and painful user profiles. The searches of AOL user No. 672368, for example, morphed over several weeks from "you're pregnant he doesn't want the baby" to "foods to eat when pregnant" to "abortion clinics charlotte nc" to "can christians be forgiven for abortion."

While these case studies are good voyeuristic fodder, snooping through one user's life barely scratches the surface of this data trove. The startup company I work for, Splunk, makes software to search computer-generated log files. AOL's 36 million log entries might look like an Orwellian nightmare to you, but for us it's a user transaction case study to die for. Using the third-party site splunkd.com, I've parsed the AOL data to create a typology of AOL Search users. Which of the seven types of searcher are you?
  • The Pornhound
  • The Manhunter
  • The Shopper
  • The Obsessive
  • The Omnivore
  • The Newbie
  • The Basket Case
Lees verder...
Gepubliceerd op Slate.com
2006-08-15, door Inne ten Have


GEEF ZELF COMMENTAAR OP DIT ARTIKEL

DOORZOEK WEBLOG



In het archief staan alle publicaties geordend op datum.
LAATSTE COMMENTAREN
MEESTE COMMENTAREN OP...