Go Back   Small Business Forum > Online Business Discussion > Internet Marketing

Reply
  #1 (permalink)  
Old 05-31-2005, 01:38 AM
Matt Probert
Guest
 
Posts: n/a
Stop Words - Lists available

Recently on webmaster world someone was asking for a list of stop
words, and upon checking Google I couldn't find any available so I
have released our lists for free use. Should anyone require them.

There are several lists:

Prepositions, Adjectives, Adverbs and Verbs, all available as separate
ASCII lists (ZIPed up), and sorted into alphabetical order.

These can be downloaded, along with other free lists, from:

http://www.probertencyclopaedia.com/xfree.htm

Matt

--
The Probert Encyclopaedia - Beyond Britannica
http://www.probertencyclopaedia.com
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #2 (permalink)  
Old 05-31-2005, 01:38 AM
Nomen Nescio
Guest
 
Posts: n/a
Re: Stop Words - Lists available

-----BEGIN TYPE III ANONYMOUS MESSAGE-----
Message-type: plaintext

you wrote:

===
Recently on webmaster world someone was asking for a list of stop
words, and upon checking Google I couldn't find any available so I
have released our lists for free use. Should anyone require them.

There are several lists:

Prepositions, Adjectives, Adverbs and Verbs, all available as separate
ASCII lists (ZIPed up), and sorted into alphabetical order.

These can be downloaded, along with other free lists, from:

http://www.probertencyclopaedia.com/xfree.htm

Matt
=====

Are stop words relevant anymore? I read recently that google is now indexing everything..
including stop words


-----END TYPE III ANONYMOUS MESSAGE-----
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #3 (permalink)  
Old 05-31-2005, 01:38 AM
Borek
Guest
 
Posts: n/a
Re: Stop Words - Lists available

On Sun, 22 May 2005 22:10:05 +0200, Nomen Nescio <nobody@dizum.com> wrote:

> Recently on webmaster world someone was asking for a list of stop
> words, and upon checking Google I couldn't find any available so I


What ARE stop words in a first place?

Best,
Borek
--
http://www.chembuddy.com - chemical calculators for labs and education
BATE - Base Acid Titration and Equilibria
program for pH calculations
CASC - Concentration and Solution Calculator
program for solution preparation and concentration conversions
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #4 (permalink)  
Old 05-31-2005, 01:38 AM
Fritz M
Guest
 
Posts: n/a
Re: Stop Words - Lists available

Borek wrote:

> What ARE stop words in a first place?


Commonly used short words that search engines don't index.

http://www.google.com/search?q=define:Stop+Words

RFM

Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #5 (permalink)  
Old 05-31-2005, 01:38 AM
anon-bounces@deuxpi.ca
Guest
 
Posts: n/a
Re: Re: Stop Words - Lists available

This is a Type III anonymous message, sent to you by the Mixminion
server at deuxpi.ca. If you do not want to receive anonymous messages,
please contact deuxpi-admin@deuxpi.ca. For more information about
anonymity, see http://mixminion.net.

-----BEGIN TYPE III ANONYMOUS MESSAGE-----
Message-type: plaintext

In <1116821071.207468.17850@g43g2000cwa.googlegroups. com> Fritz M <nospam@masoner.net> wrote:
>Borek wrote:
>
>> What ARE stop words in a first place?

>
>Commonly used short words that search engines don't index.
>
>http://www.google.com/search?q=define:Stop+Words
>
>RFM
>
>

except that google is now indexing everything...

http://www.google.com/search?q=and&s...utf-8&oe=utf-8

Results 1 - 10 of about 4,620,000,000 for and. (0.14 seconds)

the face of SE's are changing

- --
This message brought to you by a new mixminion GUI under development

-----END TYPE III ANONYMOUS MESSAGE-----
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #6 (permalink)  
Old 05-31-2005, 01:38 AM
Borek
Guest
 
Posts: n/a
Re: Stop Words - Lists available

On Mon, 23 May 2005 06:04:31 +0200, Fritz M <nospam@masoner.net> wrote:

>> What ARE stop words in a first place?

>
> Commonly used short words that search engines don't index.
>
> http://www.google.com/search?q=define:Stop+Words


Thnx. I didn't know they have a name even if I was aware of their
existence.

Best,
Borek
--
http://www.chembuddy.com - chemical calculators for labs and education
BATE - Base Acid Titration and Equilibria
program for pH calculations
CASC - Concentration and Solution Calculator
program for solution preparation and concentration conversions
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #7 (permalink)  
Old 05-31-2005, 01:38 AM
Justin Sane
Guest
 
Posts: n/a
Re: Stop Words - Lists available

> These can be downloaded, along with other free lists, from:
>
> http://www.probertencyclopaedia.com/xfree.htm


What should we do with these lists of words? :|

--
Thanks,

Justin.
http://www.auriance.com - http://www.auriance.net
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #8 (permalink)  
Old 05-31-2005, 01:38 AM
Fritz M
Guest
 
Posts: n/a
Re: Stop Words - Lists available


Justin Sane wrote:

> What should we do with these lists of words? :|


Even though they're now being indexed, you probably don't want to
select something with 4 trillion hits as an important keyword for your
site :-)

RFM

Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #9 (permalink)  
Old 05-31-2005, 01:38 AM
Stacey
Guest
 
Posts: n/a
Re: Stop Words - Lists available

"Fritz M" <nospam@masoner.net> wrote in message
news:1116864688.929030.206850@g49g2000cwa.googlegr oups.com...
>
> Justin Sane wrote:
>
>> What should we do with these lists of words? :|

>
> Even though they're now being indexed, you probably don't want to
> select something with 4 trillion hits as an important keyword for your
> site :-)



Google isn't using the words in search queries like "football and baseball".
http://www.google.com/search?hl=en&l...l+and+baseball .
It states the *and* operator isn't necessary. Of course if you use just
*and* it will list results.

Stacey


Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #10 (permalink)  
Old 05-31-2005, 01:38 AM
Nomen Nescio
Guest
 
Posts: n/a
Re: Re: Stop Words - Lists available

-----BEGIN TYPE III ANONYMOUS MESSAGE-----
Message-type: plaintext

In <9Gnke.1$8A4.0@lakeread07> "Stacey" <http://www.staceyssimplestuff.com> wrote:
>Fritz M
><nospam@masoner.net> wrote in message
>news:1116864688.929030.206850@g49g2000cwa.googleg roups.com...
>>
>> Justin Sane wrote:
>>
>>> What should we do with these lists of words? :|

>>
>> Even though they're now being indexed

>you probably don't want to
>> select something with 4 trillion hits as an important keyword for your
>> site :-)

>
>
>Google isn't using the words in search queries like "football and baseball".
>http://www.google.com/search?hl=en&l...l+and+baseball .
>It states the *and* operator isn't necessary. Of course if you use just
>*and* it will list results.


Too bad. I would think that natural language searching would be pretty popular with the
"I barely know how to load a web page" crowd. and even with everybody else for casual
searches.


- --
This message brought to you by a new mixminion GUI under development

-----END TYPE III ANONYMOUS MESSAGE-----
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
Reply

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On


All times are GMT -4. The time now is 10:43 PM.

 
         


Design by: vBulletin Skins Zone / MBA Forums
Powered by vBulletin Copyright ©2000 - 2008, Jelsoft Enterprises Ltd.
Content Relevant URLs by vBSEO 3.0.0 RC5
smallbusinessforum.com

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30