Content of final catalogue

a_reader2003 carolynwhite2 at aol.com
Fri Jul 30 13:57:55 UTC 2004


Carolyn:
Now we have been using the reject codes for a while, what do
people think about amalgamating some of the categories?

David:
I think the answer depends on how the reject category is used.

Rejected posts are not, of course, actually deleted from the
database. We have the choice as to whether they are visible to
users.

If they are to be completely hidden, then it matters very little,
IMO, whether they are lumped together or not.


Carolyn:
As mentioned before, I think we do need to have them sorted to some 
extent, for two reasons:
- an 'audit trail', so we can find the reason a post was not included 
in the catalogue
- some reject categories may interest people for further searching 
and review - movies, fanfic and perhaps OT are the most obvious
- some reject categories - FAQ/adds nothing new will be used for 
other purposes subsequently

David:
An alternative, however, is to give users the option of looking at
them. To my mind this has the advantage that we can then present
the catalogue as a true alternative to the Yahoo search function: if 
a group member wants to call up *every* post with some property (the 
obvious property is that they - or a friend - are the author; in the 
days when the search went through 1500 posts at a go, I used to 
systematically look for posts by favoured posters, no matter how OT) 
then they can.

(snip)

Carolyn:

Well, if the final decision is to put categories such 
as 'Movie', 'OT', 'Fanfic' up as part of the catalogue, I imagine it 
would be possible to have a universal search routine which enabled 
you to sort them by author perhaps - this would be a useful search 
option across the whole catalogue, I agree. However, if you wanted 
any further refinement than this, eg any kind of subject 
categorisation within a category like 'Movie', there would be no 
alternative but to go in there and code up the movie posts. We've 
made a decision not to do this right now, but someone who is 
interested could maybe do this at some later date if they wanted to.

David:
I confess part of my thinking here is that, given we have a database 
of the entire list here, why not use it? It would be great to be able 
to call up all posts that, say, include the text
string 'Hermione' and are categorised as 'Snape'. This would give a
different (but overlapping) set to those categorised as 'Hermione'
and including the string 'Snape'. What about all posts
categorised 'Snape' but *not* including the text string 'vampire'?
Or all such posts whose author was Pippin, posted between GOF
release and OOP release. Really, I think you are sitting on
something of a goldmine here, and to focus only on the categories is 
to miss out.

Carolyn:
This is exactly what I am proposing - see the MEG document. The only 
difference is that, right now, I am not proposing to include any of 
the posts we are currently deciding are rejects for whatever reason.

There is infinite flexibility to the search routines that we could 
devise to work on the categorised posts. 

David:
In a sense I think there are two different visions here: to have a
high-quality selection (the 'present to JKR' vision) and to have a
way to defeat Yahoomort's obstructionist 'search' facilities (I'm
inclined to agree with Kelley that this is all about an eventual
plan by Yahoo to 'incentivise' us to pay for a slightly-better-than-
just-barely-acceptable service). I think both visions are great, and -
 aside from possibly competing for Paul's valuable time - are not at 
all in conflict. My feeling is that *technically* it's all quite 
doable.

Carolyn:
Certainly this catalogue would defeat any such plan from Yahoo - it 
will provide a much better search mechanism than anything they are 
prepared to provide.

However, you have not persuaded me of the value of including every 
post in the catalogue at this stage.

My pragmatic perspective is that even with 13 people having access to 
the database at present, progress is still extremely slow. If we 
start trying to code up the 70% of posts that are currently being 
rejected, we will frankly never finish the project. Any refinements 
are possible down the line, but right now I'd like to try and reach 
Base Camp 1, which I have suggested is getting up to post 50 000, and 
getting some kind of prototype search routine working on that body of 
posts.

Carolyn










More information about the HPFGU-Catalogue archive