Commit Graph

89 Commits (cacbbc54870e3749424d41bf252c55ceaa8c7491)

Author SHA1 Message Date
Robert Feldt cacbbc5487 name change to QGramSortedVector, code and tests for handling unicode strings 2020-10-24 14:32:19 +02:00
Robert Feldt c0bedf89a6 faster calculation of QGramDistances if precounting qgrams either as Dict or sorted array, with tests 2020-10-23 23:33:08 +02:00
matthieugomez ac783773ba findmax -> findnearest 2020-10-03 09:42:09 -07:00
matthieugomez b2399a0ef7 up 2020-09-28 14:57:42 -07:00
matthieugomez e6898f5274 findclosest 2020-09-28 14:55:18 -07:00
matthieugomez 46ae721329 findmax -> findbest 2020-09-28 14:47:23 -07:00
matthieugomez 04b1902f9e add back normalize for Partial/TokenSort/TokenSet 2020-07-20 08:25:53 -07:00
matthieugomez fb0a786fd9 return 1 if distance over maxdist 2020-07-19 12:37:49 -07:00
matthieugomez 8c2226bf4b do not normalize Partial/TokenSet/TokenSort by default 2020-07-13 11:39:21 -07:00
matthieugomez f144292b70 cleanups 2020-02-13 09:48:35 -05:00
matthieugomez 093c536377 simplify three arguments form 2020-02-13 09:44:27 -05:00
matthieugomez 4806349088 add functors 2020-02-12 09:41:46 -05:00
matthieugomez 067a7c58d2 cleanups 2020-02-11 06:58:15 -05:00
matthieugomez 7be2b6171f Update Project.toml 2020-02-09 14:10:27 -05:00
matthieugomez 5cbbfc5bde allow any iterator in. Define evaluate for modifiers. 2020-02-09 14:10:27 -05:00
matthieugomez f4c2c9acf1 type 2020-02-08 12:03:02 -05:00
matthieugomez 30fb47e9bf update 2020-02-08 12:00:44 -05:00
matthieugomez 154f1465fd StringDistance is now just an union 2020-02-08 11:49:53 -05:00
matthieugomez e1b8aa6500 allow any iterator in qgram distances 2020-02-08 11:38:06 -05:00
matthieugomez 7eed427f9c allow any iterable 2020-02-07 08:36:15 -05:00
matthieugomez 9f30c134cf allow more than strings 2020-02-07 08:31:00 -05:00
matthieugomez 3b9493f8a9 correct partialsort 2019-12-18 10:17:08 -05:00
matthieugomez a575eeab6a rmv datastructures + add docs 2019-12-13 10:59:09 -05:00
matthieugomez 8be5a00e3d remove trie 2019-12-12 19:08:49 -05:00
matthieugomez 7d58273eae simplify missing 2019-12-12 15:40:39 -05:00
matthieugomez 82d5f3bc91 remove Hamming, create StringDistance 2019-12-12 15:11:32 -05:00
matthieugomez 16cf5abb94 improve support for missings 2019-12-12 10:57:26 -05:00
matthieugomez e0cc4f6bea update result_type 2019-12-12 10:42:32 -05:00
Dillon Daudert 55221b5794 result_type for str metrics; fix type instability in RatcliffObershelp 2019-12-12 10:42:32 -05:00
matthieugomez 318ede6665 update 2019-08-20 13:21:31 -04:00
matthieugomez 385461a61c Update StringDistances.jl 2019-08-20 12:46:53 -04:00
matthieugomez 5aa033bf05 add extract + handle Missing 2019-08-20 12:32:52 -04:00
matthieugomez f32d3ede9a remove IterTools dependency 2019-08-18 13:03:23 -04:00
matthieugomez 68702d8aa1 clean 2019-08-18 12:52:37 -04:00
matthieugomez 06d8ef1831 add tests 2019-08-17 15:46:22 -04:00
matthieugomez 9ea30c80c1 change 2019-08-17 13:14:55 -04:00
matthieugomez efcace4f03 simplify Radclikff 2019-08-17 12:57:35 -04:00
matthieugomez 402d24997f change compare 2019-08-17 12:26:24 -04:00
matthieugomez 9d33299d7b export qgram_iterator 2019-08-17 12:15:14 -04:00
matthieugomez 222a417612 correct qgram bigger than 1 2019-08-14 10:30:22 -04:00
matthieugomez a458b3356b rmv UInt8 2019-03-29 09:04:23 -04:00
matthieugomez e24e758289 encore dist.q into type + faster dict 2019-01-23 17:22:08 -05:00
matthieugomez 2834265e96 utf8 2018-11-21 14:15:28 -05:00
matthieugomez 0d505a18d9 test on 1.0 2018-08-19 00:44:10 +01:00
matthieugomez 5d535e3e06 binary 2018-07-05 10:32:10 -04:00
matthieugomez ba5a54fa84 update 0.7 2018-07-04 12:33:13 -04:00
matthieugomez f7537759b7 less exports 2018-05-17 11:41:50 -04:00
matthieugomez 1978f6f34b reorganize 2018-05-17 11:38:55 -04:00
matthieugomez c81c109548 remove grapheme 2017-08-05 15:07:48 -04:00
matthieugomez add2c6e7c4 simplify len + correct Jaro 2017-08-05 14:45:19 -04:00