我非常确定应该有一种方法可以使用 MyISAM 表中的全文索引来搜索主题标签。默认设置将执行以下操作:
textfield
hashtag
#hashtag
#two #hashtag #hashtag
SELECT * FROM table WHERE MATCH(textfield) AGAINST ('#hashtag')
> | hashtag |
> | #hashtag |
> | #two #hashtag #hashtag |
虽然它应该只返回第二行和第三行。看起来主题标签被视为单词分隔符,因此在搜索开始之前它被“删除”。我应该怎样做才能启用索引并搜索包含 #
作为单词一部分的术语?
最佳答案
如 Fine-Tuning MySQL Full-Text Search 下所述:
You can change the set of characters that are considered word characters in several ways, as described in the following list. After making the modification, rebuild the indexes for each table that contains any
FULLTEXT
indexes. Suppose that you want to treat the hyphen character ('-'
) as a word character. Use one of these methods:
Modify the MySQL source: In
storage/myisam/ftdefs.h
, see thetrue_word_char()
andmisc_word_char()
macros. Add'-'
to one of those macros and recompile MySQL.Modify a character set file: This requires no recompilation. The
true_word_char()
macro uses a “character type” table to distinguish letters and numbers from other characters. . You can edit the contents of the<ctype><map>
array in one of the character set XML files to specify that'-'
is a “letter.” Then use the given character set for yourFULLTEXT
indexes. For information about the<ctype><map>
array format, see Section 10.3.1, “Character Definition Arrays”.Add a new collation for the character set used by the indexed columns, and alter the columns to use that collation. For general information about adding collations, see Section 10.4, “Adding a Collation to a Character Set”. For an example specific to full-text indexing, see Section 12.9.7, “Adding a Collation for Full-Text Indexing”.
关于MySQL 全文搜索主题标签(包括索引中的 # 符号),我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/21296870/