Bayesian STAPLE

An algorithm that merges raters' labelings and estimates a ground truth and the performance parameters of each rater.

Installation

pip install bstaple

Example of usage

import numpy as np 
from bstaple import BayesianSTAPLE

rater1 = [0,0,0,1,1,1,0,0,0,0,0]
rater2 = [0,0,0,0,1,1,1,0,0,0,0]
rater3 = [0,0,0,0,1,1,1,0,0,0,0]
D = np.stack([rater1, rater2, rater3], axis=-1)

bayesianSTAPLE = BayesianSTAPLE(D)
trace = bayesianSTAPLE.sample(draws=10000, burn_in=1000, chains=3)

Extract the estimated ground truth:

soft_ground_truth = bayesianSTAPLE.get_ground_truth(trace)

Plot the raters' sensitivities and specifities:

import arviz as az
ax = az.plot_forest(
    trace,
    var_names=["p", "q"],
    hdi_prob=0.95,
    combined=True
  )

Arguments

D: array of {0,1} elements
Raters' labels. This array must have this shape:
( dim_1, dim_2, ..., dim_N, raters).
The first N dimensions refer to the data labeled by the raters.
If repeated_labeling=True the shape must be:
(dim_1, dim_2, ..., dim_N, iterations, raters).
w: 'hierarchical', [0,1] or array of [0,1] elements, default='hierarchical'
If it is "hierarchical", this probability will be considered as a random variable and it will be estimated from the sampling.
If it is a value between 0 and 1, all the items of the ground truth will have the same probability.
If it is an array, each item of the ground truth will have the probability specified by the array. In this case, the w-array must have shape ( dim_1, dim_2, ..., dim_N).
repeated_labeling: boolean, default=False:
Set to 'True' if the raters have made labeled multiple times for the same input. In this case, the data has to have shape (dim_1, dim_2, ..., dim_N, iterations, raters).
alpha_p: int, array of int, optional:
Number of true positives.
beta_p: int, array of int, optional:
Number of false positives.
alpha_q: int, array of int, optional:
Number of true negatives.
beta_q: int, array of int, optional:
Number of false negatives.
alpha_w: int, array of int, optional:
Number of labels 1 that are expected to be in the ground truth.
beta_w: int, array of int, optional:
Number of labels 0 that are expected to be in the ground truth.
seed: int, array of int, optional:
Seed for the sampling algorithm.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
src/bstaple		src/bstaple
tests		tests
LICENSE		LICENSE
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bayesian STAPLE

Installation

Example of usage

Arguments

About

Releases 1

Packages

Languages

License

davideC00/Bayesian-STAPLE

Folders and files

Latest commit

History

Repository files navigation

Bayesian STAPLE

Installation

Example of usage

Arguments

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages