The Golden rules of searching
"Effective searching"
(The following is being developed onto a subset of a
conference
at the canadian Recon 2006)
| The golden rules of searching | top |
Quaerite et invenietis
There are some basic rules for seekers. Of course things are different depending from the KIND of search you are
performing. Please note that different advice and techniques
apply for
long term web searching
versus
short term web searching. The following 'rules' represent
a general synthesis, that should
apply most of the times.
Almost every query can, and indeed should,
be subdivided into five
steps
think, find, refine, evaluate, collate.
(The Finder Reverses Every Corner)
-
THINK about your query
Seekers do not "plunge" into
a search out of the blue. Like artists, they visualize the correct result
before they begin. The 'perfect' answer is driving their queries. The perfect answer creates
the correct question(s)
What kind of results do you want? Books? Doctoral thesis? Images?
News? Biographies? How many results do you want? Three hundred pages of material?
One single authoritative book? A dozen pdf-articles?
A short and concise essay?
Obviously you cannot be an expert in all single field of
any and every query you will launch. But you must
be an expert in the field of finding the right resources for
each and every kind of query.
A seeker needs TWO skills:
to formulate a question correctly and to know where to look.
And this means knowing which
resources you should use for your searches. And this means you must first of all know how to search those very
resources you should use for your searches.
In fact each 'part' of the
web requires a different approach. For instance, searches on usenet, on
blogs or on ftp servers are
not ruled by the same
lore.
Also each kind of target, each quarry,
requires a different approach: for instance
when searching
news, images or books.
You must also decide if for a given query you will have to use
combing techniques like stalking,
luring or trolling.
Before even beginning, think about your query: prepare your question(s)
for the perfect result and decide which resources you will use.
-
FIND what you are looking for
Easier said than done, I know.
In fact this very complex step is at the same time the whole point of the exercise.
However, depending on the previous "thinking about your query" step, you will at least already know where you should be looking for and
what kind of techniques you'll have to use.
A general advice is to comb as much as you can, i.e.
use the knowledge that others have gathered, search those that have already searched,
and do not 'reinvent the wheel' at every query.
A second generally useful advice is to go 'regional' as much a you can, that is to use
information and resources that are located on the same plane (geographically, temporally, academically, conceptually) as
your quarry.
Anyway, if your question has been formulated
correctly and if you already know where to look, the 'finding' part will not
be too hard.
-
REFINE while searching
Your queries are usually either too wide or too narrow. Usually -in fact- they are too wide. If a subject is too wide,
as it is most of the time, you have to limit and narrow your search.
Using boolean operators (AND and NOT or + and -) will narrow the search adding and/or eliminating terms. You can also limit
your query temporally (for instance only 2005/2006), geographically
(for instance only .ru) or formally (for instance only .pdf
files).
These limits allow you to restrict results to items meeting specific criteria. I.e.:
a particular type (newspaper articles, journal articles, complete books, small snippets of text);
a particular language (English, German, Spanish, Russian, Italian, French, etc.);
a target published or produced within a particular time frame (2000-2004)
-
EVALUATE your results
This is easier said than done, again. The evaluation
phase is of paramount importance, but -alas- far from being simple.
Whatever you are looking for,
you are bound to find very good quality results, good quality results, average quality results and poor quality (or no quality at all) results.
This is not only due to the spam, but to the simple fact that the web allows anyone to publish anything he fancies.
A possible approach to evaluation is to use as a rough evaluation guide the seven
old classical questions: quis, quid, ubi, quibus
auxiliis, cur, quomodo,
quando: Who, What, Where, Helped by whom, Why, How and When.
Well, first of all you should maybe ask yourself why the heck you need to search for something at all :-)
Continue only if you have an answer to this fundamental question.
If you manage to answer the fundamental question and continue,
whenever you find a result, it is useful to ask yourself for evaluation purposes
the whole bag of classic questions. Does not take long and helps a lot.
Let's begin: quis WHO is the Author (and therefore, given his
biography,
what qualifies him to write about the matter at hand); quid WHAT is in fact the result you found
(a complete explanation,
a proof of concept, a small addition, an hypothetical solution...); check ubi WHERE you did find your result (look at the URL,
look at the server, look at the links pointing to it...); quibus
auxiliis, WHO helped the Author? (look at who OWNS the server hosting the result, look at eventual references, links, etc...);
and ask yourself cur: WHY the result has been produced and put on the web; quomodo HOW the result
has been produced (again, similar to quid/what: years of research or one half-afternoon sudden jerk?);
and finally
quando: WHEN was the result produced/published/updated, when was the web site created/updated.
(Archive org may prove
invaluable for such dating purposes. Note that you can also retrieve a site for specific date of the
past).
Simply answering the seven classical questions will already allow you to proceed towards a proper evaluation of a set of results.
Finally a word about those "ready-made" evaluations you can find on the web. Should you use them? Yep, Cum grano salis.
First of all there's a tendency to ignore "grey areas" of the web when evaluating targets. Some seem
fr instance to believe that
a pdf file should automagically and eo ipso be more worthy than an html file,
independently from its actual content.
No way:
a text is not worth
just because it has been printed and published in a book.
Its worthiness is always and only intrinsic.
Many ready-made 'evaluations' you can find on the web
are blinded by frills' bells and whistles
and tend to practicize an excessive 'formal-bowing' towards the limited 'academical'
subset. What's worse: many 'evaluators' are often utterly
incapable of judging content at face value, behind the formal aspect.
This IS a major problem: some of the most interesting lore on the web, some of the
most advanced internet techniques are discussed and developed in small circles
of geeks that may, or may not, use a clean and politically correct english, or even
know english well enough to satisfy evaluation purists.
It may also be worth noting that -in general- east european places (.ru,
.bg, .cz etc.) are (still)
"culturally" less commercially oriented and therefore
offer more "sound" valid evaluations of books/software/targets,
instead of the bogus fanbois "evaluations"
that are purposely planted on -say- amazon or ebay.
In fact you can hardly find a non-paid, sorry: non-biased, review
or comparison of certain products on our euroamerican side of the web.
-
COLLATE your results
Ok, you have performed a long search. Gathered tons of results. Painstakingly weeded out bogus and crap sites, understood which are the
most important authoritative results... and now you stop your search and go to sleep satisfied.
This is a serious mistake. A query is not finished when you have found your results. Most
will be lost if you don't COLLATE your results, squeezing the most authoritative
results into a coherent and valid interpretation. A 'conclusion' of sort.
Systematic record keeping is OF PARAMOUNT IMPORTANCE when searching. A classical mistake
is to 'forget' to keep records during complex searches.
For this purpose I suggest you simply use the NOTE function in Opera: just highlight the target
text you are interested in,
rightclick, and then choose copy to note (or use the keyboard shortcuts, either
CTRL+SHIFT+C or CTRL+ALT+E depending on the version of Opera you'r using):
*the URL* of the page you'r viewing at that moment *and the date* will be automatically stored in your note
*together with the highlighted text*.
You may want to create ad hoc note folders (for instance
"research_assembly_books_29MAY2006") and, at the
end of your search, before switching the computer off and go to sleep, just move all your
related notes inside some correctly named folders.
Opera's Notes are just text format, very easy to edit,
cat, search or prune.
Alternatively use something to take notes, even a pen and a sheet of paper will do. DO NOT rely on your memory alone (or on your
extraordinary seeking capabilities to re-find at once what you may have lost :-)
If you do, you will regret it. Sooner
than you believe.
Once you create some crumbs-paths of well kept records, collating the results will be
a quick and easy process.
- A final note about your "searching environment"
Listening to music while searching is NOT a good idea, chatting
while searching is NOT a good idea, being interrupted while refining a query is also unhelpful.
Always search & seek in a quiet and relaxed environment,
with as few disturbances as possible. No disturbing
music, no telephone, no skype, no email distractions, no IRC, no chat
(and of course no TV, duh).
A serene, calm atmosphere, will allow you to
take full advantage of your seeking efforts in an optimal way.
Serenity CREATES serendipity.
This does not have to mean soberness, austerity or ascetics, though.
If you fancy something to drink, have it ready before starting, and always chose excellent wines,
superb (i.e. belgian) beers or
the most finest teas.
Let's cut it short: Seekers should maybe
consider themselves monks of the early middle ages, sitting in their
peaceful cells, seeking old forgotten
knowledge, sipping good wines, while all those barbarians and zombies are burning everything in sight
- and torturing each other -
not far from our virtual abbey's walls.
Consider yourself a monk of the early middle ages among
dangerous barbarians
because this
is exactly what you are, both on the commercially enslaved web and in your commercially enslaved real life.

(c) III Millennium: [fravia+], all rights reserved, reversed, reviled and revealed