From Networks to Named Entities and Back Again: Exploring Classical Arabic Isnad Networks

This paper explores new methods for disambiguating the identity of individuals in classical Arabic citations (isnāds) using a network-based approach. After training a model to extract name mentions from classical Arabic, we embed these mentions in vector space using fine-tuned BERT representations a...

Полное описание

Сохранить в:  
Библиографические подробности
Другие заглавия:Mit arabischen Schriftzeichen im Text
Главные авторы: Muther, Ryan (Автор) ; Smith, David (Автор) ; Savant, Sarah Bowen (Автор)
Формат: Электронный ресурс Статья
Язык:Английский
Проверить наличие: HBZ Gateway
Journals Online & Print:
Загрузка...
Fernleihe:Fernleihe für die Fachinformationsdienste
Опубликовано: Université du Luxembourg 2023
В: Journal of historical network research
Год: 2023, Том: 8, Страницы: 1-20
Другие ключевые слова:B Hadith
B name disambiguation
B Natural Language Processing
B Network Analysis
Online-ссылка: Volltext (kostenfrei)
Volltext (kostenfrei)
Описание
Итог:This paper explores new methods for disambiguating the identity of individuals in classical Arabic citations (isnāds) using a network-based approach. After training a model to extract name mentions from classical Arabic, we embed these mentions in vector space using fine-tuned BERT representations and use community detection to infer clusters of coreferent mentions. The best-performing clustering approach reduces error on the CoNLL metric by 30%. Then, as a case study, we examine the problem of determining the number of direct transmitters to Ibn ʿAsākir (d. 1176) in a set of isnāds taken from the 12th century historical text Taʾrīkh Madīnat Dimashq (TMD, History of Damascus), using our method to replicate human judgement.
ISSN:2535-8863
Второстепенные работы:Enthalten in: Journal of historical network research
Persistent identifiers:DOI: 10.25517/jhnr.v8i1.135