Abstract
A family of probability density functions (pdfs) is defined on the unit hypersphere Sn. The parameter space for the pdfs is G(d, n+ 1) × R≥ 0, for 1 ≤ d≤ n, where G(d, n+ 1) is the Grassmannian of d-dimensional linear subspaces in Rn+1 and R≥ 0 is the range of values for a concentration parameter. This family of pdfs generalises the Watson distribution on the sphere S2. It is shown that the pdfs are tractable, in that (i) a given pdf can be sampled efficiently, (ii) the parameters of a pdf can be estimated using maximum likelihood, and (iii) the Kullback–Leibler divergence and the Fisher–Rao metric on G(d, n+ 1) × R≥ 0 have simple forms. A wide range of shapes of the pdfs can be obtained by varying d and the concentration parameter. The pdfs are used to model clusters of feature vectors on the hypersphere. The clusters are compared using the Kullback–Leibler divergences of the associated pdfs. Experiments with the mnist, Human Activity Recognition and Gas Sensor Array Drift datasets show that good results can be obtained from clustering algorithms based on the Kullback–Leibler divergence, even if the dimension n of the hypersphere is high.
| Original language | English |
|---|---|
| Pages (from-to) | 302-322 |
| Number of pages | 21 |
| Journal | Journal of Mathematical Imaging and Vision |
| Volume | 65 |
| Issue number | 2 |
| DOIs | |
| State | Published - Apr 2023 |
| Externally published | Yes |
Keywords
- Classification
- Fisher–Rao metric
- Generalised Watson distribution
- Grassmannian
- Hypergeometric function
- Hypersphere
- Kullback–Leibler divergence
Fingerprint
Dive into the research topics of 'Generalised Watson Distribution on the Hypersphere with Applications to Clustering'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver