LLMs4OL

** LLMs4OL Paradigm Task A: Term Typing Task B: Type Taxonomy Discovery Task C: Type Non-Taxonomic Relation Extraction Finetuning Task A Detailed Results Task B Detailed Results Task C Detailed Results Task A Datasets Task B Datasets Task C Datasets Finetuning Datasets **

Task B. Type Taxonomy Discovery Results

Detailed results on test sets.

Dataset Model $t_1$ $t_2$ $t_3$ $t_4$ $t_5$ $t_6$ $t_7$ $t_8$
GeoNames BERT-Large
PubMedBERT
BART-Large
Flan-T5-Large
BLOOM-1b7
Flan-T5-XL
BLOOM-3b
LLaMA-7B
GPT-3
GPT-3.5
GPT-4
Flan-T5-Large*
Flan-T5-XL*
41.005
-
38.114
59.635
33.169
49.372
35.856
33.496
43.431
59.405
38.561
42.539
48.414
51.698
-
41.033
48.246
31.049
44.057
39.120
33.496
51.742
47.792
52.465
59.403
34.803
40.557
-
40.552
54.082
33.169
45.098
53.922
33.496
42.700
67.782
34.004
40.299
55.231
48.703
-
52.500
48.241
32.839
52.413
30.227
33.496
53.202
41.951
38.897
62.466
46.964
37.165
-
39.094
44.404
33.775
43.929
35.627
33.496
46.040
48.026
44.069
46.034
57.484
41.070
-
45.801
51.309
33.530
46.348
33.606
33.496
52.566
51.728
55.433
57.415
36.293
41.707
-
36.671
36.407
36.674
49.982
48.263
33.496
45.496
45.257
33.782
42.496
59.057
54.547
-
55.400
38.449
32.922
44.298
37.731
33.496
52.626
43.860
36.234
62.045
49.261
UMLS BERT-Large
PubMedBERT
BART-Large
Flan-T5-Large
BLOOM-1b7
Flan-T5-XL
BLOOM-3b
LLaMA-7B
GPT-3
GPT-3.5
GPT-4
Flan-T5-Large*
Flan-T5-XL*
48.215
33.713
36.029
47.558
33.713
64.256
33.169
32.948
51.584
61.380
41.195
37.176
63.693
38.842
33.713
48.218
51.221
36.188
46.533
37.233
32.948
49.412
70.385
76.999
48.667
50.046
41.467
33.713
41.429
55.320
33.713
51.006
34.823
32.948
49.865
63.915
42.558
36.074
36.917
40.412
33.713
49.907
40.947
38.262
41.549
35.777
32.948
42.901
66.821
63.889
42.121
41.343
45.889
33.713
39.372
49.455
33.713
60.077
33.169
32.948
50.573
63.144
50.288
48.396
78.127
40.911
33.713
47.479
50.873
35.895
42.831
35.895
32.948
46.070
67.271
78.116
46.654
50.122
41.041
33.713
42.398
44.232
33.278
51.257
33.059
32.948
45.367
56.648
36.594
53.428
79.255
42.922
33.713
45.464
42.909
33.605
41.186
37.483
32.948
46.728
64.412
60.728
35.970
39.274
schema.org BERT-Large
PubMedBERT
BART-Large
Flan-T5-Large
BLOOM-1b7
Flan-T5-XL
BLOOM-3b
LLaMA-7B
GPT-3
GPT-3.5
GPT-4
Flan-T5-Large*
Flan-T5-XL*
43.851
-
34.628
46.983
33.395
42.708
41.643
33.374
49.646
56.843
58.479
35.358
91.063
41.172
-
38.693
49.924
47.833
33.455
47.169
33.374
49.289
74.385
72.827
85.436
57.469
44.067
-
39.281
46.118
33.395
33.591
47.980
33.374
50.977
58.525
65.831
29.824
74.688
43.200
-
52.909
54.788
39.777
42.766
45.255
33.374
48.031
70.164
63.306
89.248
65.329
43.703
-
38.203
40.277
38.925
36.694
39.733
33.374
47.191
53.359
50.565
41.305
91.541
40.054
-
41.170
54.479
48.568
34.041
40.758
33.374
48.632
72.354
74.247
91.681
50.635
42.151
-
43.261
42.060
44.357
33.751
51.280
33.374
48.878
54.165
57.452
42.461
91.709
43.720
-
42.744
47.930
39.578
36.456
48.736
33.374
49.489
71.030
63.694
56.395
33.333