Package: comparator 0.1.2

comparator: Comparison Functions for Clustering and Record Linkage

Implements functions for comparing strings, sequences and numeric vectors for clustering and record linkage applications. Supported comparison functions include: generalized edit distances for comparing sequences/strings, Monge-Elkan similarity for fuzzy comparison of token sets, and L-p distances for comparing numeric vectors. Where possible, comparison functions are implemented in C/C++ to ensure good performance.

Authors:Neil Marchant [aut, cre]

comparator_0.1.2.tar.gz
comparator_0.1.2.zip(r-4.5)comparator_0.1.2.zip(r-4.4)comparator_0.1.2.zip(r-4.3)
comparator_0.1.2.tgz(r-4.4-x86_64)comparator_0.1.2.tgz(r-4.4-arm64)comparator_0.1.2.tgz(r-4.3-x86_64)comparator_0.1.2.tgz(r-4.3-arm64)
comparator_0.1.2.tar.gz(r-4.5-noble)comparator_0.1.2.tar.gz(r-4.4-noble)
comparator_0.1.2.tgz(r-4.4-emscripten)comparator_0.1.2.tgz(r-4.3-emscripten)
comparator.pdf |comparator.html
comparator/json (API)
NEWS

# Install 'comparator' in R:
install.packages('comparator', repos = c('https://ngmarchant.r-universe.dev', 'https://cloud.r-project.org'))

Peer review:

Bug tracker:https://github.com/ngmarchant/comparator/issues

Uses libs:
  • c++– GNU Standard C++ Library v3

On CRAN:

clusteringdistance-measuresdistance-metricsentity-resolutionrecord-linkagesimilarity-measuresstring-similarity

24 exports 16 stars 1.97 score 4 dependencies 1 mentions 47 scripts 357 downloads

Last updated 3 years agofrom:d41ddaa16a. Checks:OK: 1 NOTE: 8. Indexed: yes.

TargetResultDate
Doc / VignettesOKSep 03 2024
R-4.5-win-x86_64NOTESep 03 2024
R-4.5-linux-x86_64NOTESep 03 2024
R-4.4-win-x86_64NOTESep 03 2024
R-4.4-mac-x86_64NOTESep 03 2024
R-4.4-mac-aarch64NOTESep 03 2024
R-4.3-win-x86_64NOTESep 03 2024
R-4.3-mac-x86_64NOTESep 03 2024
R-4.3-mac-aarch64NOTESep 03 2024

Exports:as.matrixas.PairwiseMatrixBinaryCompChebyshevConstantDamerauLevenshteinelementwiseEuclideanFuzzyTokenSetgmeanHamminghmeanInVocabularyJaroJaroWinklerLCSLevenshteinLookupManhattanMinkowskiMongeElkanOSApairwisePairwiseMatrix

Dependencies:clueclusterproxyRcpp

Readme and manuals

Help Manual

Help pageTopics
Binary String/Sequence ComparatorBinaryComp
Chebyshev Numeric ComparatorChebyshev
Virtual Comparator ClassComparator-class
Constant String/Sequence ComparatorConstant
Virtual Class for a Sequence Comparator with a C++ ImplementationCppSeqComparator-class
Damerau-Levenshtein String/Sequence ComparatorDamerauLevenshtein
Elementwise Similarity/Distance Vectorelementwise elementwise,Chebyshev,matrix,matrix-method elementwise,CppSeqComparator,list,list-method elementwise,FuzzyTokenSet,list,list-method elementwise,InVocabulary,vector,vector-method elementwise,Lookup,vector,vector-method elementwise,MongeElkan,list,list-method elementwise,NumericComparator,matrix,vector-method elementwise,NumericComparator,vector,matrix-method elementwise,NumericComparator,vector,vector-method elementwise,StringComparator,vector,vector-method
Euclidean Numeric ComparatorEuclidean
Fuzzy Token Set ComparatorFuzzyTokenSet
Geometric Meangmean gmean.default
Hamming String/Sequence ComparatorHamming
Harmonic Meanhmean hmean.default
In-Vocabulary ComparatorInVocabulary
Jaro String/Sequence ComparatorJaro
Jaro-Winkler String/Sequence ComparatorJaroWinkler
Longest Common Subsequence (LCS) ComparatorLCS
Levenshtein String/Sequence ComparatorLevenshtein
Lookup String ComparatorLookup
Manhattan Numeric ComparatorManhattan
Minkowski Numeric ComparatorMinkowski
Monge-Elkan Token ComparatorMongeElkan
Virtual Numeric Comparator ClassNumericComparator-class
Optimal String Alignment (OSA) String/Sequence ComparatorOSA
Pairwise Similarity/Distance Matrixelementwise,Minkowski,matrix,matrix-method pairwise pairwise,Chebyshev,matrix,matrix-method pairwise,Chebyshev,matrix,NULL-method pairwise,Comparator,ANY,missing-method pairwise,CppSeqComparator,list,list-method pairwise,CppSeqComparator,list,NULL-method pairwise,FuzzyTokenSet,list,list-method pairwise,FuzzyTokenSet,vector,NULL-method pairwise,InVocabulary,vector,NULL-method pairwise,InVocabulary,vector,vector-method pairwise,Lookup,vector,NULL-method pairwise,Lookup,vector,vector-method pairwise,Minkowski,matrix,matrix-method pairwise,Minkowski,matrix,NULL-method pairwise,MongeElkan,list,list-method pairwise,MongeElkan,list,NULL-method pairwise,NumericComparator,matrix,vector-method pairwise,NumericComparator,vector,matrix-method pairwise,StringComparator,vector,NULL-method pairwise,StringComparator,vector,vector-method
Pairwise Similarity/Distance Matrixas.matrix,PairwiseMatrix-method as.PairwiseMatrix as.PairwiseMatrix,matrix-method PairwiseMatrix PairwiseMatrix-class
Virtual Sequence Comparator ClassSequenceComparator-class
Virtual String Comparator ClassStringComparator-class
Virtual Token Comparator ClassTokenComparator-class