prism

command module

v0.0.0-...-c2e9db7 Latest Latest Go to latest Published: Dec 5, 2019 License: BSD-3-Clause Imports: 18 Imported by: 0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/pointlander/prism

Links

Open Source Insights

README ¶

Prism: an unsupervised clustering and nearest neighbor search algorithm

Clustering places similar vectors into the same cluster. Nearest neighbor search finds similar vectors to a search query vector. Prism does both of these things using artificial neural networks and decision trees The gradient descent learning algorithm is used to find an artificial neural network autoencoder. The autoencoder is trained to maximize the variance of the output of the middle layer for the given training data. The top half of the autoencoder is removed and a decision is learned using variance reduction. The learned decision is used to split the training data into two sets, and the algorithm is recursively applied to create a binary tree.

Results

The below results are from applying the algorithm to the iris data set. The label entropy is computed for the learned clusters. The entropy is normalized between 0 and 1. Lower entropy is better. The best entropy of kmeans is 0.153569, and the best entropy of the prism algorithm is 0.086526. Prism performs ~1.8 times better than kmeans on the iris data set.

Results for kmeans with different distance metrics

distance func	entropy
CanberraDistance	0.153569
EuclideanDistance	0.248515
ManhattanDistance	0.257049
SquaredEuclideanDistance	0.263581
BrayCurtisDistance	0.271857
ChebyshevDistance	0.441861
HammingDistance	1.000000

Results for run of algorithm with seed of 1

mode	consistency	entropy
variance	12	0.086526
none	5	0.086526
parallel	6	0.086526
raw	6	0.238475
entropy	12	0.318924
orthogonality	10	0.349453
mixed	10	0.349453

Averaged results for 16 runs of algorithm with different seeds

mode	entropy mean	entropy variance
variance	0.086526	0.000000
none	0.234585	0.010058
raw	0.238475	0.000000
parallel	0.252508	0.013845
orthogonality	0.351067	0.002562
mixed	0.352417	0.002496
entropy	0.382682	0.002498

Documentation ¶

There is no documentation for this package.

Source Files ¶

View all Source files

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL