ID | 68936 |
FullText URL | |
Author |
Fujii, Shota
Graduate School of Natural Science and Technology, Okayama University
Kawaguchi, Nobutaka
Research & Development Group, Hitachi, Ltd.
Shigemoto, Tomohiro
Research & Development Group, Hitachi, Ltd.
Yamauchi, Toshihiro
Faculty of Natural Science and Technology, Okayama University
ORCID
Kaken ID
publons
researchmap
|
Abstract | Cybersecurity threats have been increasing and growing more sophisticated year by year. In such circumstances, gathering Cyber Threat Intelligence (CTI) and following up with up-to-date threat information is crucial. Structured CTI such as Structured Threat Information eXpression (STIX) is particularly useful because it can automate security operations such as updating FW/IDS rules and analyzing attack trends. However, as most CTIs are written in natural language, manual analysis with domain knowledge is required, which becomes quite time-consuming.
In this work, we propose CyNER, a method for automatically structuring CTIs and converting them into STIX format. CyNER extracts named entities in the context of CTI and then extracts the relations between named entities and IOCs in order to convert them into STIX. In addition, by using key phrase extraction, CyNER can extract relations between IOCs that lack contextual information, such as those listed at the bottom of a CTI, and named entities. We describe our design and implementation of CyNER and demonstrate that it can extract named entities with the F-measure of 0.80 and extract relations between named entities and IOCs with the maximum accuracy of 81.6%. Our analysis of structured CTI showed that CyNER can extract IOCs that are not included in existing reputation sites, and that it can automatically extract IOCs that have been exploited for a long time and across multiple attack groups. CyNER is thus expected to contribute to the efficiency of CTI analysis. |
Note | This is an Accepted Manuscript of a Conference paper published by Springer International Publishing.
IWSEC 2022
Lecture Notes in Computer Science, volume 13504
|
Published Date | 2022-08-12
|
Publication Title |
Advances in Information and Computer Security
|
Publisher | Springer International Publishing
|
Start Page | 85
|
End Page | 104
|
ISSN | 0302-9743
|
Content Type |
Book
|
language |
English
|
OAI-PMH Set |
岡山大学
|
Copyright Holders | © 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
|
File Version | author
|
DOI | |
Web of Science KeyUT | |
Related Url | isVersionOf https://doi.org/10.1007/978-3-031-15255-9_5
|
Citation | Fujii, S., Kawaguchi, N., Shigemoto, T., Yamauchi, T. (2022). CyNER: Information Extraction from Unstructured Text of CTI Sources with Noncontextual IOCs. In: Cheng, CM., Akiyama, M. (eds) Advances in Information and Computer Security. IWSEC 2022. Lecture Notes in Computer Science, vol 13504. Springer, Cham. https://doi.org/10.1007/978-3-031-15255-9_5
|