Warning: Can't synchronize with repository "(default)" (/home/git/ome.git does not appear to be a Git repository.). Look in the Trac log for more information.
Notice: In order to edit this ticket you need to be either: a Product Owner, The owner or the reporter of the ticket, or, in case of a Task not yet assigned, a team_member"

User Story #1011 (new)

Opened 16 years ago

Last modified 14 years ago

Consider a field-name delegating analyzer

Reported by: jamoore Owned by: jamoore
Priority: major Milestone: Unscheduled
Component: Search Keywords: analyzer, search
Cc: wmoore Story Points: n.a.
Sprint: n.a. Importance: n.a.
Total Remaining Time: n.a. Estimated Remaining Time: n.a.

Description

The simple analyzer introduced in #1010 can possibly lead to confusion. From Will:

> If my file is called biology-data/CSFV-GFP01_3_R3D and I search for
> Name: "GFP*" it won't be found. (the * is a wild card).
>
> This is because, currently, CSFV-GFP01 is treated as a single
> "token".
> 
> So, if we "tokenise" using "-" , then the search index will have
> the words "GFP" and "CSFV", and not GFP-CSFV. If I search with
> "GFP" it will find this image.
> 
> However, if I search for "CSFV-GFP", then this search string will
> also be tokenised the same way, so I'll be searching for "CSFV" and
> "GFP" separately. I guess this will find the file OK, but it won't
> distinguish between a file named biology-data/CSFV-GFP01_3_R3D and
> CSFV-data/H2B-GFP01_3_R3D since both have "CSFV" and "GFP" in the
> name. Although it may? favour the former, based on the proximity of
> the words?
> 
> Maybe, in the grand scheme of things, this is not a problem.
>
>
> One question I have is whether you can have different tokenisers
> for the file name and for text annotations (comments, descriptions
> etc). Ideally, if I've got CSFV-GFP in a text description, I'd like
> to be able to find it by searching for "CSFV-GFP". (When using this
> search string for a text annotation search, it would have to NOT be
> tokenised on "-"). So, if you have more than one tokeniser for
> indexing, you'd have to have more than one for searching different
> fields.

Change History (2)

comment:1 Changed 15 years ago by jmoore

  • Milestone changed from OMERO-Beta4 to GatherReqs

Haven't seen a clear need for this yet.

comment:2 Changed 14 years ago by jmoore

  • Milestone changed from GatherReqs to Unscheduled

See #3164

Note: See TracTickets for help on using tickets. You may also have a look at Agilo extensions to the ticket.

1.3.13-PRO © 2008-2011 Agilo Software all rights reserved (this page was served in: 0.66238 sec.)

We're Hiring!