Commit Graph

231 Commits (7b9568d028c80e5d0452302134da4b3b385e7ae9)

Author SHA1 Message Date
matthieugomez f270e18310 simplify pairwise 2020-12-30 14:39:18 +01:00
matthieugomez 15175976a2 Update modifiers.jl 2020-12-30 14:24:24 +01:00
matthieugomez f9675fd110 update 2020-11-14 11:40:44 -08:00
matthieugomez 1cc89f0827 add more docs 2020-11-12 09:24:34 -08:00
matthieugomez 730a513d8e redefine modifiers 2020-11-11 21:13:14 -08:00
matthieugomez bd9c7fba24 use pre-preprocessing in find* functions 2020-11-10 14:20:32 -08:00
matthieugomez 4e2ab1b062 NormalizedMultisetDistance -> NMD 2020-11-10 13:52:31 -08:00
Robert Feldt ed6c2f650f
Add NMD and fix bug in MorisitaOverlap (#40) 2020-11-10 10:55:05 -08:00
matthieugomez f4185fbfe0 add test 2020-11-10 07:29:48 -08:00
matthieugomez d20650bb9e add readme about Hamming and pairwise 2020-11-10 07:14:13 -08:00
Robert Feldt 0c57f62319
Adds MorisitaOverlap distance (#39) 2020-11-10 07:12:28 -08:00
matthieugomez e4095682b4 add Hamming + restrict pairwise to vectors + handle missings 2020-11-09 19:04:35 -08:00
matthieugomez b407b186f0 Update pairwise.jl 2020-11-08 17:33:17 -08:00
matthieugomez 8c2811b564 Update pairwise.jl 2020-11-08 17:26:04 -08:00
matthieugomez 55abbc7702 Update pairwise.jl 2020-11-08 17:24:17 -08:00
matthieugomez e1aafafe11 Update pairwise.jl 2020-11-08 17:13:18 -08:00
matthieugomez 3040be7ced export pairwise! 2020-11-08 17:05:14 -08:00
Robert Feldt a0e5347d8c
pairwise for calculating distance matrices (#38) 2020-11-08 16:16:28 -08:00
matthieugomez c7728160bf correct 2020-11-07 12:48:53 -08:00
matthieugomez a53c7a9d2f add max_dist as part field for Levenshtein 2020-11-07 11:46:47 -08:00
Robert Feldt 9d28c36ed5 added doc strings and upped the dependency and CI to Julia 1.3 2020-10-24 21:01:39 +02:00
Robert Feldt cacbbc5487 name change to QGramSortedVector, code and tests for handling unicode strings 2020-10-24 14:32:19 +02:00
Robert Feldt c0bedf89a6 faster calculation of QGramDistances if precounting qgrams either as Dict or sorted array, with tests 2020-10-23 23:33:08 +02:00
matthieugomez ac783773ba findmax -> findnearest 2020-10-03 09:42:09 -07:00
matthieugomez b2399a0ef7 up 2020-09-28 14:57:42 -07:00
matthieugomez e6898f5274 findclosest 2020-09-28 14:55:18 -07:00
matthieugomez 46ae721329 findmax -> findbest 2020-09-28 14:47:23 -07:00
matthieugomez 42ddeb32ba Update qgram.jl 2020-07-23 12:49:00 -07:00
matthieugomez b0bd5eb47e Update normalize.jl 2020-07-20 08:46:42 -07:00
matthieugomez f802f5fe1a Delete modifiers.jl 2020-07-20 08:39:50 -07:00
matthieugomez 006cb31f81 Update normalize.jl 2020-07-20 08:32:01 -07:00
matthieugomez 04b1902f9e add back normalize for Partial/TokenSort/TokenSet 2020-07-20 08:25:53 -07:00
matthieugomez e0ef0e8ec1 correct normalize Partial/TokenSort/TokenSet 2020-07-20 07:08:27 -07:00
matthieugomez b5a2a10adc Update normalize.jl 2020-07-19 12:38:54 -07:00
matthieugomez fb0a786fd9 return 1 if distance over maxdist 2020-07-19 12:37:49 -07:00
matthieugomez 100a0b65a9 Update normalize.jl 2020-07-13 11:40:51 -07:00
matthieugomez 8c2226bf4b do not normalize Partial/TokenSet/TokenSort by default 2020-07-13 11:39:21 -07:00
matthieugomez 09da838753 Update edit.jl 2020-07-13 10:44:45 -07:00
matthieugomez 4df4bad6af solve https://github.com/matthieugomez/StringDistances.jl/issues/30 2020-07-13 10:40:30 -07:00
matthieugomez 77afc0d4fb Update edit.jl 2020-07-13 08:56:34 -07:00
matthieugomez 26221a13ed Update edit.jl 2020-07-13 08:52:30 -07:00
matthieugomez 8468d4fddc Update find.jl 2020-04-20 14:27:03 -04:00
matthieugomez 388055a622 Update find.jl 2020-04-20 14:09:52 -04:00
matthieugomez c04cb32bdf Update find.jl 2020-04-20 14:08:29 -04:00
matthieugomez 3c0b8d2f60 voc 2020-02-25 19:40:14 -05:00
matthieugomez 6f22f2c9f5 clean 2020-02-24 09:41:38 -05:00
matthieugomez afafe93bf6 use enumerate 2020-02-21 10:16:52 -05:00
matthieugomez acded3bf2a Update edit.jl 2020-02-19 08:42:17 -05:00
matthieugomez 8965d2e9a3 Update edit.jl 2020-02-19 08:39:09 -05:00
matthieugomez 5e11cd19c9 Update edit.jl 2020-02-19 08:35:17 -05:00