Clusteval logo ClustEval clustering evaluation framework
Navigation:
027Hints:

Here you see general information about the dataset.

Data Set 'brown' - General

brown

The data set contains pairwise similarities of blasted sequences of 232 proteins belonging to the amidohydrolase superfamily. A gold standard is provided describing families within the given superfamily. According to the gold standard the amidrohydrolase superfamily contains 29 families.

Publication: Shoshana D Brown, John A Gerlt, Jennifer L Seffernick, and Patricia C Babbitt. A gold standard set of mechanistically diverse enzyme superfamilies. Genome Biol, 7(1):R8, 2006.(Link)


Information Value
Aliasbrown
Namesfld/sfld_brown_et_al_amidohydrolases_protein_similarities_for_beh.txt
Description:The data set contains pairwise similarities of blasted sequences of 232 proteins belonging to the amidohydrolase superfamily. A gold standard is provided describing families within the given superfamily. According to the gold standard the amidrohydrolase superfamily contains 29 families.
Publication:Shoshana D Brown, John A Gerlt, Jennifer L Seffernick, and Patricia C Babbitt. A gold standard set of mechanistically diverse enzyme superfamilies. Genome Biol, 7(1):R8, 2006.
Dataset formatRowwise Similarity
DownloadClick (by downloading the dataset you accept the license presented below)
License
First 10 lines