Wo finde ich die MySQL Stopword-Liste?

Irgendwelche Probleme mit XAMPP für Linux? Dann ist hier genau der richtige Ort um nachzufragen.

Wo finde ich die MySQL Stopword-Liste?

Postby Maurizio » 02. November 2005 13:21

Hallo zusammen,

ich bin nun schon geraume Zeit auf Suche nach der Stopword-Liste für MySQL, welche laut mysql.com in der Datei ft_static.c definiert ist. Leider finde ich diese jedoch nicht in XAMPP.

Siehe auch:
http://dev.mysql.com/doc/refman/4.0/de/ ... uning.html

Kann mir jemand sagen, wo bzw. wie ich an die Stopwords komme?

Vielen Dank für die Hilfe,

Maurizio
Maurizio
 
Posts: 11
Joined: 04. February 2004 14:10

Postby Wiedmann » 02. November 2005 13:45

Code: Select all
const char *ft_precompiled_stopwords[] = {

#ifdef COMPILE_STOPWORDS_IN

/* This particular stopword list was taken from SMART distribution
   ftp://ftp.cs.cornell.edu/pub/smart/smart.11.0.tar.Z
   it was slightly modified to my taste, though
 */

  "a's",
  "able",
  "about",
  "above",
  "according",
  "accordingly",
  "across",
  "actually",
  "after",
  "afterwards",
  "again",
  "against",
  "ain't",
  "all",
  "allow",
  "allows",
  "almost",
  "alone",
  "along",
  "already",
  "also",
  "although",
  "always",
  "am",
  "among",
  "amongst",
  "an",
  "and",
  "another",
  "any",
  "anybody",
  "anyhow",
  "anyone",
  "anything",
  "anyway",
  "anyways",
  "anywhere",
  "apart",
  "appear",
  "appreciate",
  "appropriate",
  "are",
  "aren't",
  "around",
  "as",
  "aside",
  "ask",
  "asking",
  "associated",
  "at",
  "available",
  "away",
  "awfully",
  "be",
  "became",
  "because",
  "become",
  "becomes",
  "becoming",
  "been",
  "before",
  "beforehand",
  "behind",
  "being",
  "believe",
  "below",
  "beside",
  "besides",
  "best",
  "better",
  "between",
  "beyond",
  "both",
  "brief",
  "but",
  "by",
  "c'mon",
  "c's",
  "came",
  "can",
  "can't",
  "cannot",
  "cant",
  "cause",
  "causes",
  "certain",
  "certainly",
  "changes",
  "clearly",
  "co",
  "com",
  "come",
  "comes",
  "concerning",
  "consequently",
  "consider",
  "considering",
  "contain",
  "containing",
  "contains",
  "corresponding",
  "could",
  "couldn't",
  "course",
  "currently",
  "definitely",
  "described",
  "despite",
  "did",
  "didn't",
  "different",
  "do",
  "does",
  "doesn't",
  "doing",
  "don't",
  "done",
  "down",
  "downwards",
  "during",
  "each",
  "edu",
  "eg",
  "eight",
  "either",
  "else",
  "elsewhere",
  "enough",
  "entirely",
  "especially",
  "et",
  "etc",
  "even",
  "ever",
  "every",
  "everybody",
  "everyone",
  "everything",
  "everywhere",
  "ex",
  "exactly",
  "example",
  "except",
  "far",
  "few",
  "fifth",
  "first",
  "five",
  "followed",
  "following",
  "follows",
  "for",
  "former",
  "formerly",
  "forth",
  "four",
  "from",
  "further",
  "furthermore",
  "get",
  "gets",
  "getting",
  "given",
  "gives",
  "go",
  "goes",
  "going",
  "gone",
  "got",
  "gotten",
  "greetings",
  "had",
  "hadn't",
  "happens",
  "hardly",
  "has",
  "hasn't",
  "have",
  "haven't",
  "having",
  "he",
  "he's",
  "hello",
  "help",
  "hence",
  "her",
  "here",
  "here's",
  "hereafter",
  "hereby",
  "herein",
  "hereupon",
  "hers",
  "herself",
  "hi",
  "him",
  "himself",
  "his",
  "hither",
  "hopefully",
  "how",
  "howbeit",
  "however",
  "i'd",
  "i'll",
  "i'm",
  "i've",
  "ie",
  "if",
  "ignored",
  "immediate",
  "in",
  "inasmuch",
  "inc",
  "indeed",
  "indicate",
  "indicated",
  "indicates",
  "inner",
  "insofar",
  "instead",
  "into",
  "inward",
  "is",
  "isn't",
  "it",
  "it'd",
  "it'll",
  "it's",
  "its",
  "itself",
  "just",
  "keep",
  "keeps",
  "kept",
  "know",
  "knows",
  "known",
  "last",
  "lately",
  "later",
  "latter",
  "latterly",
  "least",
  "less",
  "lest",
  "let",
  "let's",
  "like",
  "liked",
  "likely",
  "little",
  "look",
  "looking",
  "looks",
  "ltd",
  "mainly",
  "many",
  "may",
  "maybe",
  "me",
  "mean",
  "meanwhile",
  "merely",
  "might",
  "more",
  "moreover",
  "most",
  "mostly",
  "much",
  "must",
  "my",
  "myself",
  "name",
  "namely",
  "nd",
  "near",
  "nearly",
  "necessary",
  "need",
  "needs",
  "neither",
  "never",
  "nevertheless",
  "new",
  "next",
  "nine",
  "no",
  "nobody",
  "non",
  "none",
  "noone",
  "nor",
  "normally",
  "not",
  "nothing",
  "novel",
  "now",
  "nowhere",
  "obviously",
  "of",
  "off",
  "often",
  "oh",
  "ok",
  "okay",
  "old",
  "on",
  "once",
  "one",
  "ones",
  "only",
  "onto",
  "or",
  "other",
  "others",
  "otherwise",
  "ought",
  "our",
  "ours",
  "ourselves",
  "out",
  "outside",
  "over",
  "overall",
  "own",
  "particular",
  "particularly",
  "per",
  "perhaps",
  "placed",
  "please",
  "plus",
  "possible",
  "presumably",
  "probably",
  "provides",
  "que",
  "quite",
  "qv",
  "rather",
  "rd",
  "re",
  "really",
  "reasonably",
  "regarding",
  "regardless",
  "regards",
  "relatively",
  "respectively",
  "right",
  "said",
  "same",
  "saw",
  "say",
  "saying",
  "says",
  "second",
  "secondly",
  "see",
  "seeing",
  "seem",
  "seemed",
  "seeming",
  "seems",
  "seen",
  "self",
  "selves",
  "sensible",
  "sent",
  "serious",
  "seriously",
  "seven",
  "several",
  "shall",
  "she",
  "should",
  "shouldn't",
  "since",
  "six",
  "so",
  "some",
  "somebody",
  "somehow",
  "someone",
  "something",
  "sometime",
  "sometimes",
  "somewhat",
  "somewhere",
  "soon",
  "sorry",
  "specified",
  "specify",
  "specifying",
  "still",
  "sub",
  "such",
  "sup",
  "sure",
  "t's",
  "take",
  "taken",
  "tell",
  "tends",
  "th",
  "than",
  "thank",
  "thanks",
  "thanx",
  "that",
  "that's",
  "thats",
  "the",
  "their",
  "theirs",
  "them",
  "themselves",
  "then",
  "thence",
  "there",
  "there's",
  "thereafter",
  "thereby",
  "therefore",
  "therein",
  "theres",
  "thereupon",
  "these",
  "they",
  "they'd",
  "they'll",
  "they're",
  "they've",
  "think",
  "third",
  "this",
  "thorough",
  "thoroughly",
  "those",
  "though",
  "three",
  "through",
  "throughout",
  "thru",
  "thus",
  "to",
  "together",
  "too",
  "took",
  "toward",
  "towards",
  "tried",
  "tries",
  "truly",
  "try",
  "trying",
  "twice",
  "two",
  "un",
  "under",
  "unfortunately",
  "unless",
  "unlikely",
  "until",
  "unto",
  "up",
  "upon",
  "us",
  "use",
  "used",
  "useful",
  "uses",
  "using",
  "usually",
  "value",
  "various",
  "very",
  "via",
  "viz",
  "vs",
  "want",
  "wants",
  "was",
  "wasn't",
  "way",
  "we",
  "we'd",
  "we'll",
  "we're",
  "we've",
  "welcome",
  "well",
  "went",
  "were",
  "weren't",
  "what",
  "what's",
  "whatever",
  "when",
  "whence",
  "whenever",
  "where",
  "where's",
  "whereafter",
  "whereas",
  "whereby",
  "wherein",
  "whereupon",
  "wherever",
  "whether",
  "which",
  "while",
  "whither",
  "who",
  "who's",
  "whoever",
  "whole",
  "whom",
  "whose",
  "why",
  "will",
  "willing",
  "wish",
  "with",
  "within",
  "without",
  "won't",
  "wonder",
  "would",
  "would",
  "wouldn't",
  "yes",
  "yet",
  "you",
  "you'd",
  "you'll",
  "you're",
  "you've",
  "your",
  "yours",
  "yourself",
  "yourselves",
  "zero",
#endif

  NULL };
Wiedmann
AF Moderator
 
Posts: 17102
Joined: 01. February 2004 12:38
Location: Stuttgart / Germany

Postby Maurizio » 02. November 2005 15:49

Danke, nur welche Datei muss ich editieren, um die Stopwords meinen Bedürfnissen anzupassen?
Maurizio
 
Posts: 11
Joined: 04. February 2004 14:10

Postby Wiedmann » 02. November 2005 19:15

welche Datei muss ich editieren, um die Stopwords meinen Bedürfnissen anzupassen?

Die Antwort auf deine Frage hast du doch in deinem ersten Post schon geschrieben? (Wird ja zudem in dem von dir geposteten Link erklärt).
Wiedmann
AF Moderator
 
Posts: 17102
Joined: 01. February 2004 12:38
Location: Stuttgart / Germany

Postby Maurizio » 03. November 2005 00:12

Erwähnt habe ich die ft_static.c. Ja bin ich denn blind? Ich kann diese Datei nirgends finden :roll:
Maurizio
 
Posts: 11
Joined: 04. February 2004 14:10

Postby Wiedmann » 03. November 2005 00:20

Ich kann diese Datei nirgends finden

Durchaus möglich. Je nach dem wo du suchst...

Hast du denn deinen Link auch richtig durchgelesen?
Wiedmann
AF Moderator
 
Posts: 17102
Joined: 01. February 2004 12:38
Location: Stuttgart / Germany

Postby Maurizio » 03. November 2005 11:16

Da will mich jetzt aber jemand fordern :wink:

Liegt es daran, dass die LAMPP Version keine original Quelldistribution ist?

Ich hatte schonmal das Problem, das ich die zu indexierende Wortlänge definieren musste. Da ich die ftdefs.h-Datei ebenfalls nicht finden konnte, habe ich seinerzeit in der [mysqld] Sektion der Datei "my.cnf" eine Variable (set-variable=ft_min_word_len=X) eingefügt. Dies funktioniert seitdem, war allerdings auch nicht wie im genannten mysql.com-Link beschrieben.
Maurizio
 
Posts: 11
Joined: 04. February 2004 14:10

Postby Wiedmann » 03. November 2005 11:35

Liegt es daran, dass die LAMPP Version keine original Quelldistribution ist?

Hast du was kompilieren müssen bei der XAMPP-Installation?

Ich hatte schonmal das Problem, das ich die zu indexierende Wortlänge definieren musste. Da ich die ftdefs.h-Datei ebenfalls nicht finden konnte, habe ich seinerzeit in der [mysqld] Sektion der Datei "my.cnf" eine Variable (set-variable=ft_min_word_len=X) eingefügt.

Das muss dann aber zu Zeiten pre 4.0 gewesen sein...

Dies funktioniert seitdem, war allerdings auch nicht wie im genannten mysql.com-Link beschrieben.

Also ich kann das dort nachlesen...

BTW seh ich grad, dass ich mir das Posten der Stopwortliste hätte sparen können. Die gibt es dort ja direkt zum Anschauen.
Wiedmann
AF Moderator
 
Posts: 17102
Joined: 01. February 2004 12:38
Location: Stuttgart / Germany

Postby Maurizio » 03. November 2005 12:16

Wiedmann wrote:Hast du was kompilieren müssen bei der XAMPP-Installation?

Nein, natürlich nicht.

Das muss dann aber zu Zeiten pre 4.0 gewesen sein...

Nein, erst neulich in Version 1.4.13, MySQL 4.1.11

Also ich kann das dort nachlesen...

Wo kannst Du das mit der my.cnf nachlesen?

BTW seh ich grad, dass ich mir das Posten der Stopwortliste hätte sparen können. Die gibt es dort ja direkt zum Anschauen.

Sorry, aber ich sehe dort auch keine Stopwortliste...

Reden wir vielleicht aneinander vorbei? Ich suche die Stopwortliste in LAMPP Version 1.4.13, MySQL 4.1.11. Ich weiss es ja nicht, aber ist genannte Liste erst in neuere LAMPP Version "eingebaut" worden?
Maurizio
 
Posts: 11
Joined: 04. February 2004 14:10

Postby Wiedmann » 03. November 2005 12:39

Maurizio wrote:
Wiedmann wrote:
Maurizio wrote:...eine Variable (set-variable=ft_min_word_len=X)...
Dies funktioniert seitdem, war allerdings auch nicht wie im genannten mysql.com-Link beschrieben.

Also ich kann das dort nachlesen...

Wo kannst Du das mit der my.cnf nachlesen?

Dort: http://dev.mysql.com/doc/refman/4.1/en/ ... uning.html
Wir reden nach wie vor von der offiziellen Doku wo alles drin steht...

Sorry, aber ich sehe dort auch keine Stopwortliste...

Dort, genau 2 Menüpunkte höher (und immer noch unter dem Kapitel Full-Text): http://dev.mysql.com/doc/refman/4.1/en/ ... words.html

Ich weiss es ja nicht, aber ist genannte Liste erst in neuere LAMPP Version "eingebaut" worden?

Nein, die ist integraler Bestandteil von MySQL und ist schon immer mit einkompiliert.

Reden wir vielleicht aneinander vorbei?

Nicht wirklich. Du musst nur genau lesen was da steht auf dieser Manualseite. Würdest du zudem noch das Manual das passend zu deiner Serverversion ist benutzen (fällt mir ja jetzt erst auf...), würdest du sogar sehen dass du richtig Glück hast.
Wiedmann
AF Moderator
 
Posts: 17102
Joined: 01. February 2004 12:38
Location: Stuttgart / Germany

Postby Maurizio » 03. November 2005 13:25

Wiedmann wrote:Nein, die ist integraler Bestandteil von MySQL und ist schon immer mit einkompiliert.


Dann nenne mir doch bitte mal den Pfad dahin bzw. den Namen der Datei. Laut Manual heisst das File "ft_static.c", welches ich absolut nicht finden kann. Ich komme hier im Moment, auch mit Manual, absolut nicht weiter :roll:

Edit: Ich sehe das im Moment so: Ohne MySQL-Sourcen keine ft_static.c. Mit dem reinen LAMPP-Paket: keine Anpassung der Stopwords möglich.
Maurizio
 
Posts: 11
Joined: 04. February 2004 14:10


Return to XAMPP für Linux

Who is online

Users browsing this forum: No registered users and 19 guests