Commit Graph

88 Commits (d650b62a4015f5f1f8b4011fffafb4af579925d8)

Author SHA1 Message Date
matthieugomez d650b62a40 clean tests 2020-11-14 12:37:04 -08:00
matthieugomez f9675fd110 update 2020-11-14 11:40:44 -08:00
matthieugomez 730a513d8e redefine modifiers 2020-11-11 21:13:14 -08:00
matthieugomez bd9c7fba24 use pre-preprocessing in find* functions 2020-11-10 14:20:32 -08:00
Robert Feldt ed6c2f650f
Add NMD and fix bug in MorisitaOverlap (#40) 2020-11-10 10:55:05 -08:00
matthieugomez f4185fbfe0 add test 2020-11-10 07:29:48 -08:00
Robert Feldt 0c57f62319
Adds MorisitaOverlap distance (#39) 2020-11-10 07:12:28 -08:00
matthieugomez e4095682b4 add Hamming + restrict pairwise to vectors + handle missings 2020-11-09 19:04:35 -08:00
matthieugomez 3040be7ced export pairwise! 2020-11-08 17:05:14 -08:00
Robert Feldt a0e5347d8c
pairwise for calculating distance matrices (#38) 2020-11-08 16:16:28 -08:00
matthieugomez c7728160bf correct 2020-11-07 12:48:53 -08:00
Robert Feldt 11ec55799a
fix tests for short strings (shorter than q in QGramDistances) (#37) 2020-10-25 12:46:25 -07:00
Robert Feldt cacbbc5487 name change to QGramSortedVector, code and tests for handling unicode strings 2020-10-24 14:32:19 +02:00
Robert Feldt c0bedf89a6 faster calculation of QGramDistances if precounting qgrams either as Dict or sorted array, with tests 2020-10-23 23:33:08 +02:00
matthieugomez b2399a0ef7 up 2020-09-28 14:57:42 -07:00
matthieugomez e6898f5274 findclosest 2020-09-28 14:55:18 -07:00
matthieugomez 46ae721329 findmax -> findbest 2020-09-28 14:47:23 -07:00
matthieugomez 04b1902f9e add back normalize for Partial/TokenSort/TokenSet 2020-07-20 08:25:53 -07:00
matthieugomez e0ef0e8ec1 correct normalize Partial/TokenSort/TokenSet 2020-07-20 07:08:27 -07:00
matthieugomez 4df4bad6af solve https://github.com/matthieugomez/StringDistances.jl/issues/30 2020-07-13 10:40:30 -07:00
matthieugomez 49b1f3b439 slower but simpler iteration 2020-02-18 08:18:45 -05:00
matthieugomez 093c536377 simplify three arguments form 2020-02-13 09:44:27 -05:00
matthieugomez ce24604a63 rmv inferred test for 1.0 2020-02-12 09:43:48 -05:00
matthieugomez 4806349088 add functors 2020-02-12 09:41:46 -05:00
matthieugomez 29c2b6aeca allo any iterator in Jaro + add tests 2020-02-11 07:39:15 -05:00
matthieugomez 7be2b6171f Update Project.toml 2020-02-09 14:10:27 -05:00
matthieugomez 5cbbfc5bde allow any iterator in. Define evaluate for modifiers. 2020-02-09 14:10:27 -05:00
matthieugomez e1b8aa6500 allow any iterator in qgram distances 2020-02-08 11:38:06 -05:00
matthieugomez 7eed427f9c allow any iterable 2020-02-07 08:36:15 -05:00
matthieugomez a575eeab6a rmv datastructures + add docs 2019-12-13 10:59:09 -05:00
matthieugomez 164448f5d7 allow skipmissing iterator 2019-12-12 17:00:50 -05:00
matthieugomez 82d5f3bc91 remove Hamming, create StringDistance 2019-12-12 15:11:32 -05:00
matthieugomez 538c37991b return indices 2019-12-12 14:26:25 -05:00
matthieugomez 16cf5abb94 improve support for missings 2019-12-12 10:57:26 -05:00
Dillon Daudert 55221b5794 result_type for str metrics; fix type instability in RatcliffObershelp 2019-12-12 10:42:32 -05:00
matthieugomez 6d38c066ad Update modifiers.jl 2019-08-20 13:44:49 -04:00
matthieugomez 318ede6665 update 2019-08-20 13:21:31 -04:00
matthieugomez 1196915726 extract 2019-08-20 12:43:21 -04:00
matthieugomez 5aa033bf05 add extract + handle Missing 2019-08-20 12:32:52 -04:00
matthieugomez 5ea65c389a Only keep compare for Levenshtein and Damerau 2019-08-20 11:59:23 -04:00
matthieugomez 7edca83311 refinement 2019-08-19 13:54:38 -04:00
matthieugomez 6aef47bc89 add maximum distance for Jaro, Levenshtein, DamerauLevenshtein 2019-08-19 13:12:55 -04:00
matthieugomez dd09371297 update 2019-08-18 13:13:04 -04:00
matthieugomez 9e73346c07 add 2019-08-17 16:12:41 -04:00
matthieugomez 06d8ef1831 add tests 2019-08-17 15:46:22 -04:00
matthieugomez fc3fc17992 add tests 2019-08-17 14:38:49 -04:00
matthieugomez 402d24997f change compare 2019-08-17 12:26:24 -04:00
matthieugomez cf4f92cbf5 Update utf8.jl 2019-08-17 11:58:48 -04:00
matthieugomez b9bf5ac553 Update modifiers.jl 2019-08-14 10:45:16 -04:00
matthieugomez fc6a85fb32 Update utf8.jl 2019-08-14 10:30:36 -04:00