Hands-Free & No Config: Test Drive Athena Now!
Athena Security
  • HOME
  • PRODUCTS
    • Athena Core
    • Athena NIDS
    • Athena AI Analyst (Pallas)
    • Athena XDR+ (Ageleia)
  • SERVICES
    • Athena SecOps
    • Athena MDR
    • Athena vCISO
  • ATHENA LABS
    • Areas of Research
    • Market Research
    • Publications
    • AthenaBench Results
  • ABOUT
    • About Athena
    • Leadership Team
    • Board of Directors
    • Advisory Board
    • Press Releases
    • FAQs
  • BLOG
  • CONTACT
  • Click to open the search input field Click to open the search input field Search
  • Menu Menu

Publications

Explore Athena’s published technical papers, articles, and AI-driven security insights from Athena Labs to stay informed on cybersecurity innovation.

Word Embeddings and Semantic Spaces in Natural Language Processing

International Journal of Intelligence Science, 13, 1-21. Worth, P. J. (2023)
Abstract
One of the critical hurdles, and breakthroughs, in the field of Natural Language Processing (NLP) in the last two decades has been the development of techniques for text representation that solves the so-called curse of dimensionality, a problem which plagues NLP in general given that the feature set for learning starts as a function of the size of the language in question, upwards of hundreds of thousands of terms typically. As such, much of the research and development in NLP in the last two decades has been in finding and optimizing solutions to this problem, to feature selection in NLP effectively. This paper looks at the development of these various techniques, leveraging a variety of statistical methods which rest on linguistic theories that were advanced in the middle of the last century, namely the distributional hypothesis which suggests that words that are found in similar contexts generally have similar meanings. In this survey paper we look at the development of some of the most popular of these techniques from a mathematical as well as data structure perspective, from Latent Semantic Analysis to Vector Space Models to their more modern variants which are typically referred to as word embeddings. In this review of algoriths such as Word2Vec, GloVe, ELMo and BERT, we explore the idea of semantic spaces more generally beyond applicability to NLP.

KEYWORDS
Natural Language Processing, Vector Space Models, Semantic Spaces, Word Embeddings, Representation Learning, Text Vectorization, Machine Learning, Deep Learning

Layer Cake: On Language Representation and Compute Characteristics in Text Classification

Peter J. Worth Jr.*, Ionut Cardei

Abstract

Since transformer-based language models were introduced in 2017, they have been shown to be extraordinarily effective across a variety of NLP tasks including but not limited to language generation. The introduction and widespread adoption of these LLMs, which encode extremely high-dimensional semantic spaces, comes at a significant cost in terms of system and computational resource requirements, requirements that have reshaped the entire chip (GPU) and data center industry as hardware, cloud, and infrastructure providers try to keep up with the demand. This has motivated the research community to develop a variety of design strategies that optimize the use of these resources; however, computational requirements continue to grow in proportion to model size and complexity. With this study, we introduce a framework called Layer Cake for the precise measurement of the relative computational resource requirements necessary for text classification using a variety of classifiers from across the Machine Learning (ML) and Deep Learning (DL) landscape, leveraging different language model families and focusing on the forms of language representation used in different test scenarios. We find that while LLMs do yield the best results across classifiers on average, these improvements come at a significant computational overhead. For example, from a Macro-F1 score perspective, LLM-based classifiers outperform their static embedding language model counterparts (Word2Vec, FastText & GloVe), even when encapsulated in DL architectures such as Convolutional Neural Networks or Long-Short-Term Networks by 8.87% on average, and perform 12.73% better than ML classifiers such as Support Vector Machines and Logistic Regression models. However, this uptick in model performance comes at a computational overhead cost of 4398.07% compared to the GPU requirements of static word embedding DL classifiers, and a 4126.02% increase in computation time relative to ML classifiers, the latter of which are CPU, rather than GPU, bound.

Keywords
Text Classification, Large Language Models, LLMs, Natural Language Processing, NLP, Language Representation, Word Embeddings, Vector Space Models, Semantic Spaces, Artificial Intelligence, AI, Computational Linguistics

AthenaBench: A Dynamic Benchmark for Evaluating LLMs in Cyber Threat Intelligence

Md Tanvirul Alam, Dipkamal Bhusal, Salman Ahmad, Nidhi Rastogi, Peter Worth

Abstract
Large Language Models (LLMs) have demonstrated strong capabilities in natural language reasoning, yet their application to Cyber Threat Intelligence (CTI) remains limited. CTI analysis involves distilling large volumes of unstructured reports into actionable knowledge, a process where LLMs could substantially reduce analyst workload. CTIBench introduced a comprehensive benchmark for evaluating LLMs across multiple CTI tasks. In this work, we extend CTIBench by developing AthenaBench, an enhanced benchmark that includes an improved dataset creation pipeline, duplicate removal, refined evaluation metrics, and a new task focused on risk mitigation strategies. We evaluate twelve LLMs, including state-of-the-art proprietary models such as GPT-5 and Gemini-2.5 Pro, alongside seven open-source models from the LLaMA and Qwen families. While proprietary LLMs achieve stronger results overall, their performance remains subpar on reasoning-intensive tasks, such as threat actor attribution and risk mitigation, with open-source models trailing even further behind. These findings highlight fundamental limitations in the reasoning capabilities of current LLMs and underscore the need for models explicitly tailored to CTI workflows and automation.

Subjects:
Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI)

Minerva: Reinforcement Learning with Verifiable Rewards for Cyber Threat Intelligence LLMs

Md Tanvirul Alam, Aritran Piplai, Ionut Cardei, Nidhi Rastogi, Peter J Worth Jr

Abstract

Cyber threat intelligence (CTI) analysts routinely convert noisy, unstructured security artifacts into standardized, automation-ready representations. Although large language models (LLMs) show promise for this task, existing approaches remain brittle when producing structured CTI outputs and have largely relied on supervised fine-tuning (SFT). In contrast, CTI standards and community-maintained resources define canonical identifiers and schemas that enable deterministic verification of model outputs. We leverage this structure to study reinforcement learning with verifiable rewards (RLVR) for CTI tasks. We introduce \textit{Minerva}, a unified dataset and training pipeline spanning multiple CTI subtasks, each paired with task-specific verifiers that score structured outputs and identifier predictions. To address reward sparsity during rollout, we propose a lightweight self-training mechanism that generates additional verified trajectories and distills them back into the model. Experiments across LLM backbones show consistent improvements in accuracy and robustness over SFT across multiple benchmarks.

Subjects:
Machine Learning (cs.LG)

Idealogical Reference Architecture (IRA): An Epistemological Interpretation of Quantum Mechanics1

Peter J. Worth1, Seetesh Pande2

Abstract

With this work, we introduce a system of idealogical metaphysics which is primarily born out of an epistemological interpretation of Quantum Mechanics (QM). Our interpretative stance follows a long line of consciousness (what we call mind) based on interpretations, or explanations, for the so-called measurement problem in QM, a position held by physicists (and mathematicians) such as Von Neumann, Wigner, Bohm, Stapp, Manousakis, Blaha, and Pradhan among others. Given this perspective, we conceive of the measurement problem to be a function of the boundary condition between mind and matter itself, a condition we wish to shed light on by abstracting the problem of measurement itself out of physics proper and (more directly) into the domain of philosophy explicitly using modern conceptions of epistemology and information theory, as well as quantum measurement theory, to construct a system of metaphysics, based upon knowledge and information processing and theory, that sheds light on the relationship between mind and matter generally. In this context, akin to Alan Turing’s work in theoretical computer science in 1950 which introduced the concept of a theoretical computing machine which ultimately provided the basis for modern computers, we introduce the notion of an idealogical computing machine, or IRA (Idealogical Reference Architecture), which is constructed based upon modern software development models and paradigms (object oriented programming and design primarily) which represent the de facto standard used by information processing systems in modern computing applications. We, however, take the additional conceptual abstraction from information to knowledge, after which IRA can be viewed within the broader philosophical dialogue, both in its Western (Plato, Aristotle, Descartes, Kant, Schopenhauer) as well as Eastern (Vedanta, Samkhya, Daoism) dialects. To this end, we hope this work can serve as a framework for comparison and further development as a sort of reference architecture, in philosophical, theological and (theoretical) scientific circles, to provide precision and clarity to metaphysical discussions in the same way the Turing Machine provided for a more precise definition of computer system design (and limits).

Subjects

Metaphysics, Theoretical Computer Science, Cognitive Science, Information Processing, Epistemology, Quantum Mechanics, Idealism, Plato, Kant, Daoism, Taoism, I Ching, Book of Changes, Samkhya, Yoga, Schopenhauer, Theoretical Physics

DEBRA: On the Unsupervised Learning of Concept Hierarchies from (Literary) Text

Peter J. Worth1, Domagoj Doresic2

Abstract

With this work, we introduce a novel method for the unsupervised learning of conceptual hierarchies, or concept maps as they are sometimes called, which is aimed specifically for use with literary texts, as such distinguishing itself from the majority of research literature on the topic which is primarily focused on building ontologies from a vast array of different types of data sources, both structured and unstructured, to support various forms of AI, in particular, the Semantic Web as envisioned by Tim Berners-Lee. We first elaborate on mutually informing disciplines of philosophy and computer science, or more specifically the relationship between metaphysics, epistemology, ontology, computing and AI, followed by a technically in-depth discussion of DEBRA, our dependency tree based concept hierarchy constructor, which as its name alludes to, constructs a conceptual map in the form of a directed graph which illustrates the concepts, their respective relations, and the implied ontological structure of the concepts as encoded in the text, decoded with standard Python NLP libraries such as spaCy and NLTK. With this work we hope to both augment the Knowledge Representation literature with opportunities for intellectual advancement in AI with more intuitive, less analytical, and well-known forms of knowledge representation from the cognitive science community, as well as open up new areas of research between Computer Science and the Humanities with respect to the application of the latest in NLP tools and techniques upon literature of cultural significance, shedding light on existing methods of computation with respect to documents in semantic space that effectively allows for, at the very least, the comparison and evolution of texts through time, using vector space math.

Subjects

Ontology Learning, Ontology Engineering, Concept Hierarchies, Concept Mapping, Concept Maps, Artificial Intelligence, Philosophy, Natural Language Processing, Knowledge Representation, Knowledge Representation and Reasoning, Machine Learning, Natural Language Processing, NLP, Computer Science, Theoretical Computer Science, Epistemology, Metaphysics, Philosophy, Logic, Computing, Ontology, First Order Logic, Predicate Calculus

Contact: labs@athenasecuritygrp.com

  • Areas of Research
  • Market Research
  • Publications
  • AthenaBench Results

Contact Us

Please enable JavaScript in your browser to complete this form.
Loading
© Copyright - Athena Software Group, Inc. 2026
  • Privacy Policy
  • Client Login
Scroll to top Scroll to top Scroll to top