Postgresql full-text search extension for chinese
It implements by importing swcs.
- Downloads
git clone https://github.com/jaiminpan/pg_scws
- Compile
Make sure PostgreSQL is installed and command pg_config
could be runnable.
cd pg_scws
USE_PGXS=1 make && make install
scws=# create extension pg_scws;
CREATE EXTENSION
scws=# select * from to_tsvector('scwscfg', '小明硕士毕业于中国科学院计算所,后在日本京都大学深造');
to_tsvector
---------------------------------------------------------------------------
'中国科学院计算所':4 '小明':1 '日本京都大学':5 '毕业':3 '深造':6 '硕士':2
(1 row)
scws=# select * from to_tsvector('scwscfg', '李小福是创新办主任也是云计算方面的专家');
to_tsvector
---------------------------------------------------------------------------------
'专家':10 '主任':5 '云':7 '创新':3 '办':4 '方面':9 '是':2,6 '李小福':1 '计算':8
(1 row)
It should work with PostgreSQL > 9.x Now only tested with PostgreSQL 9.4
scws project when you only need scws
zhparser project when you only need a thin pg layer (you need install scws separately before using zhparser)
Jaimin Pan: [email protected]