public inbox for gentoo-dev@lists.gentoo.org
 help / color / mirror / Atom feed
From: Thomas de Grenier de Latour <degrenier@easyconnect.fr>
To: gentoo-dev@lists.gentoo.org
Subject: Re: [gentoo-dev] opinion on how to improve the website redesign
Date: Tue, 22 Nov 2005 23:59:57 +0100	[thread overview]
Message-ID: <20051122235957.5b820699@eusebe> (raw)
In-Reply-To: <20051122175144.GD9086@gentoo.org>

On Tue, 22 Nov 2005 18:51:44 +0100
Sven Vermeulen <swift@gentoo.org> wrote:
>> A good start could be to do that the quick and ugly way, thanks
>> to Google (with some "site:www.gentoo.org/some/thing/" and other
>> black magic in the query terms).
> [...]

> - Google bases its search functionality on cached pages. 

Bah, yes, theory is that it's not 100% perfect, but in practice i
find it satisfying.

> - We would depend on Google a bit

Yes, but if other engines offer similar functionalities, in which
case it would just be a matter of changing the forms params names
and posting it elsewhere. But i don't know much about other public
search engines, so i have no idea about what kind of queries they
allow.

> Now Google might be a reliable web site/service, I'd rather have
> the search functionality of our web site implemented on the
> Gentoo infrastructure.

Sure, if that's doable in terms of workload and time to implement,
then it could be the best method.

My only concern would be on the choice of that engine: i mean,
i would still prefer Google over an internal engine which doesn't
allow mixing of exact strings and keywords in queries, or which
drops non-alpha chars, etc. I'm suffering enough with the forum's
one already :)

> - Restricting pages to /doc (documentation), /main (Gentoo
> information), /news (News items+GWN), /proj (project stuff)

Not a problem with google, that's the "/some/thing/" part of
the above cited fake query. I've put some real examples in the
proof-of-concept form i've posted about in an earlier message
somewhere else in that thread:
http://tdegreni.free.fr/gentoosearch/

> - Restricting languages (en, fr, ... and any combination)

Same as above for searching in a single language, adding some
"/fr/" to the base URL (or also possible using the lr=lang_fr
parameter, although it's less reliable). But for arbitrary
combinations, yes, that's probably a limitation (or a really ugly
query...).

What i've thought for i18n of the above JS code was to:
 - always at least propose search on the english pages
 - if user has defined in his browser a non-english preferred
language, also add some localised choices to the dropdown list.
(I'm not sure how to detect the user preferred lang from Javascript
though).

> - Have the search points assigned so that hits are calculated
> with certain weights:
>     * title's get most of the points, unless many titles are
> selected
>     * abstract's get the second most points, yada yada
>     * content get third most points

Here again, i think google is good enough for the needs, especially
if you target the search on some "/doc/en/" or alike sub-parts of
the website, which don't let that many pages anyway. I mean, i
often do that kind of searchs on the docs or the dev handbook with
a conquery plugin, and i don't remember having ever seen the page i
was looking for not beeing in the top 5 results. But yes, at least
in theory, a tweaked local engine could be even better.


Hmm... re-reading the above message, i realize i may sound like
some kind of google-zealot: so just to make it clear, i'm not, and i
would be pleased to see anything better implemented. It's really
just that i think it could do a rather good job and that using it is
easy enough to be a really short-term solution.

--
TGL.
-- 
gentoo-dev@gentoo.org mailing list



  parent reply	other threads:[~2005-11-22 23:00 UTC|newest]

Thread overview: 29+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2005-11-21 14:09 [gentoo-dev] opinion on how to improve the website redesign Mike Frysinger
2005-11-21 14:49 ` Thomas de Grenier de Latour
2005-11-21 14:57   ` Mike Frysinger
2005-11-21 15:19 ` Grant Goodyear
2005-11-21 17:24 ` Ciaran McCreesh
2005-11-21 18:21   ` Henrik Brix Andersen
2005-11-21 22:39   ` Luca Barbato
2005-11-21 22:47   ` Diego 'Flameeyes' Pettenò
2005-11-21 20:35 ` A. Khattri
2005-11-21 21:07   ` Chris Gianelloni
2005-11-22  0:32     ` Corey Shields
2005-11-22 11:16       ` Paul de Vrieze
2005-11-22  0:14 ` Grant Goodyear
2005-11-22  0:55   ` Mike Frysinger
2005-11-22  8:42   ` Luis F. Araujo
2005-11-22 14:09     ` Lance Albertson
2005-11-22 15:03       ` Mike Frysinger
2005-11-22 17:06         ` [gentoo-dev] " Duncan
2005-11-22  0:53 ` [gentoo-dev] " Ingo Bormuth
2005-11-22  1:11   ` Mike Frysinger
2005-11-22  2:53     ` Thomas de Grenier de Latour
2005-11-22  3:05       ` Georgi Georgiev
2005-11-22  3:08         ` Georgi Georgiev
2005-11-22  5:04           ` Mike Frysinger
2005-11-22 18:45             ` Thomas de Grenier de Latour
2005-11-22 17:51       ` Sven Vermeulen
2005-11-22 21:34         ` Mike Frysinger
2005-11-22 22:59         ` Thomas de Grenier de Latour [this message]
2005-11-22 11:13 ` Paul de Vrieze

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20051122235957.5b820699@eusebe \
    --to=degrenier@easyconnect.fr \
    --cc=gentoo-dev@lists.gentoo.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox