Thursday, October 11, 2012

Interact Intranet: Noise Words


To improve processing time for search queries it is common for applications and databases to ignore certain words which have low value.  SQL Server has it's own list of noise words which you can edit and change.
Interact also has a list however I've been told they are hard-coded into the system and not customizable.
Here is the list of Interact's noise words:

1 2 3 4 5 6 7 8 9 0 a b c d e f g h i j k l m n o p q r s t u v w x y z not about after all also an and another any are as at be because been before being between both but by came can come could did do each for from get got has had he have her here him himself his how if in into is it like make many me might more most much must my never now of on only or other our out over said same see should since some still such take than that the their them then there these they this those through to too under up very was way we well were what where which while who with would you your & ? use

Curiously if you look at the Interact database you'll notice there is a table called NoiseWords which has a single, unpopulated, column called WRD.

Reference (login required): http://extranet.interact-intranet.com/Interact/Pages/Content/Document.aspx?id=4609

No comments:

Post a Comment