From Networks to Named Entities and Back Again: Exploring Classical Arabic Isnad Networks
This paper explores new methods for disambiguating the identity of individuals in classical Arabic citations (isnāds) using a network-based approach. After training a model to extract name mentions from classical Arabic, we embed these mentions in vector space using fine-tuned BERT representations a...
Другие заглавия: | Mit arabischen Schriftzeichen im Text |
---|---|
Главные авторы: | ; ; |
Формат: | Электронный ресурс Статья |
Язык: | Английский |
Проверить наличие: | HBZ Gateway |
Journals Online & Print: | |
Fernleihe: | Fernleihe für die Fachinformationsdienste |
Опубликовано: |
Université du Luxembourg
2023
|
В: |
Journal of historical network research
Год: 2023, Том: 8, Страницы: 1-20 |
Другие ключевые слова: | B
Hadith
B name disambiguation B Natural Language Processing B Network Analysis |
Online-ссылка: |
Volltext (kostenfrei) Volltext (kostenfrei) |
Итог: | This paper explores new methods for disambiguating the identity of individuals in classical Arabic citations (isnāds) using a network-based approach. After training a model to extract name mentions from classical Arabic, we embed these mentions in vector space using fine-tuned BERT representations and use community detection to infer clusters of coreferent mentions. The best-performing clustering approach reduces error on the CoNLL metric by 30%. Then, as a case study, we examine the problem of determining the number of direct transmitters to Ibn ʿAsākir (d. 1176) in a set of isnāds taken from the 12th century historical text Taʾrīkh Madīnat Dimashq (TMD, History of Damascus), using our method to replicate human judgement. |
---|---|
ISSN: | 2535-8863 |
Второстепенные работы: | Enthalten in: Journal of historical network research
|
Persistent identifiers: | DOI: 10.25517/jhnr.v8i1.135 |