Paper deep dive

Q-AGNN: Quantum-Enhanced Attentive Graph Neural Network for Intrusion Detection

Devashish Chaudhary, Sutharshan Rajasegarar, Shiva Raj Pokhrel

Year: 2026Venue: arXiv preprintArea: cs.CRType: PreprintEmbeddings: 72

Intelligence

Status: succeeded | Model: google/gemini-3.1-flash-lite-preview | Prompt: intel-v1 | Confidence: 96%

Last extracted: 3/26/2026, 1:32:56 AM

Summary

Q-AGNN is a hybrid quantum-classical graph neural network designed for intrusion detection. It models network flows as nodes in a graph and uses parameterized quantum circuits (PQCs) to encode multi-hop neighborhood information into a high-dimensional latent space. The model employs an attention mechanism to weight these quantum-enhanced embeddings, effectively capturing complex, non-linear relational dependencies in network traffic while remaining robust under NISQ hardware conditions.

Entities (5)

Q-AGNN · model · 100%Intrusion Detection · application · 99%Parameterized Quantum Circuits · technology · 98%Graph Neural Network · architecture · 97%IBM Quantum Hardware · hardware · 95%

Relation Signals (3)

Q-AGNN → appliedto → Intrusion Detection

confidence 99% · Q-AGNN, a Quantum-Enhanced Attentive Graph Neural Network for intrusion detection

Q-AGNN → uses → Parameterized Quantum Circuits

confidence 98% · Q-AGNN leverages parameterized quantum circuits (PQCs) to encode multi-hop neighborhood information

Q-AGNN → executedon → IBM Quantum Hardware

confidence 95% · we also executed the Q-AGNN framework on actual IBM quantum hardware

Cypher Suggestions (2)

Identify technologies used by a specific model · confidence 95% · unvalidated

MATCH (m:Model {name: 'Q-AGNN'})-[:USES]->(t:Technology) RETURN t.name

Find all models applied to intrusion detection · confidence 90% · unvalidated

MATCH (m:Model)-[:APPLIED_TO]->(a:Application {name: 'Intrusion Detection'}) RETURN m.name

Abstract

Abstract:With the rapid growth of interconnected devices, accurately detecting malicious activities in network traffic has become increasingly challenging. Most existing deep learning-based intrusion detection systems treat network flows as independent instances, thereby failing to exploit the relational dependencies inherent in network communications. To address this limitation, we propose Q-AGNN, a Quantum-Enhanced Attentive Graph Neural Network for intrusion detection, where network flows are modeled as nodes and edges represent similarity relationships. Q-AGNN leverages parameterized quantum circuits (PQCs) to encode multi-hop neighborhood information into a high-dimensional latent space, inducing a bounded quantum feature map that implements a second-order polynomial graph filter in a quantum-induced Hilbert space. An attention mechanism is subsequently applied to adaptively weight the quantum-enhanced embeddings, allowing the model to focus on the most influential nodes contributing to anomalous behavior. Extensive experiments conducted on four benchmark intrusion detection datasets demonstrate that Q-AGNN achieves competitive or superior detection performance compared to state-of-the-art graph-based methods, while consistently maintaining low false positive rates under hardware-calibrated noise conditions. Moreover, we also executed the Q-AGNN framework on actual IBM quantum hardware to demonstrate the practical operability of the proposed pipeline under real NISQ conditions. These results highlight the effectiveness of integrating quantum-enhanced representations with attention mechanisms for graph-based intrusion detection and underscore the potential of hybrid quantum-classical learning frameworks in cybersecurity applications.

PDF

Open source PDF →Open local PDF →

Full Text

71,786 characters extracted from source content.

Expand or collapse full text

[1] These authors contributed equally to this work. These authors contributed equally to this work. 1] of Information Technology, University, , , 3220, Q-AGNN: Quantum-Enhanced Attentive Graph Neural Network for Intrusion Detection s224281473@deakin.edu.au sutharshan.rajasegarar@deakin.edu.au Raj shiva.pokhrel@deakin.edu.au [ Abstract With the rapid growth of interconnected devices, accurately detecting malicious activities in network traffic has become increasingly challenging. Most existing deep learning-based intrusion detection systems treat network flows as independent instances, thereby failing to exploit the relational dependencies inherent in network communications. To address this limitation, we propose Q-AGNN, a Quantum-Enhanced Attentive Graph Neural Network for intrusion detection, where network flows are modeled as nodes and edges represent similarity relationships. Q-AGNN leverages parameterized quantum circuits (PQCs) to encode multi-hop neighborhood information into a high-dimensional latent space, inducing a bounded quantum feature map that implements a second-order polynomial graph filter in a quantum-induced Hilbert space. An attention mechanism is subsequently applied to adaptively weight the quantum-enhanced embeddings, allowing the model to focus on the most influential nodes contributing to anomalous behavior. Extensive experiments conducted on four benchmark intrusion detection datasets demonstrate that Q-AGNN achieves competitive or superior detection performance compared to state-of-the-art graph-based methods, while consistently maintaining low false positive rates under hardware-calibrated noise conditions. Moreover, we also executed the Q-AGNN framework on actual IBM quantum hardware to demonstrate the practical operability of the proposed pipeline under real NISQ conditions. These results highlight the effectiveness of integrating quantum-enhanced representations with attention mechanisms for graph-based intrusion detection and underscore the potential of hybrid quantum-classical learning frameworks in cybersecurity applications. keywords: quantum machine learning, quantum computing, quantum graph neural network, intrusion detection, network security 1 Introduction The rapid proliferation of interconnected devices, cloud services, and Internet-of-Things (IoT) infrastructures has significantly increased the complexity and scale of modern networks([33]). While this connectivity enables efficient data exchange and intelligent services, it also expands the attack surface, making networks increasingly vulnerable to sophisticated cyber threats. Intrusion Detection Systems (IDS) play a crucial role in identifying malicious activities and safeguarding network integrity. However, the evolving nature of attacks and the high volume of network traffic pose substantial challenges to existing IDS solutions([24]). Traditional machine learning and deep learning-based intrusion detection approaches typically model network traffic flows as independent samples, relying on handcrafted or learned feature representations([29]). Although such methods have achieved promising performance, they often overlook the inherent relational dependencies among network entities, such as communication patterns, temporal correlations, and shared behavioral characteristics. Ignoring these dependencies limits the ability of IDS models to detect complex and coordinated attacks that manifest across multiple network flows([37]). To address this limitation, recent studies have adopted graph-based learning techniques, where network traffic is represented as graphs and detection is performed using Graph Neural Networks (GNNs)([20, 7, 4, 3, 16, 5]). While these methods improve detection by aggregating neighborhood information, they remain constrained by the expressiveness of classical aggregation functions and often struggle to capture high-order, non-linear relationships in large-scale or noisy network environments. Moreover, deeper GNN architectures may suffer from issues such as over-smoothing and performance degradation([22]). In parallel, quantum machine learning (QML) has emerged as a promising paradigm capable of encoding complex, high-dimensional data using quantum feature spaces([10, 31]). Parameterized quantum circuits (PQCs) have been shown to provide expressive representations that may enhance learning performance in certain tasks. Nevertheless, the practical deployment of fully quantum models is hindered by current hardware limitations, including noise and restricted qubit counts, motivating the development of hybrid quantum-classical architectures that can operate effectively in the near-term noisy intermediate-scale quantum (NISQ) era([1]). Motivated by these observations, we propose Q-AGNN, a Quantum-Enhanced Attentive Graph Neural Network for intrusion detection. In Q-AGNN, network flows are modeled as nodes and edges encode similarity relationships. Multi-hop neighborhood features are first encoded using parameterized quantum circuits to capture complex correlations, after which an attention mechanism selectively emphasizes the most influential quantum-enhanced embeddings. The resulting node representations are then used for accurate intrusion classification. The main contributions of this work are summarized as follows: • We introduce a novel hybrid quantum-classical graph-based intrusion detection framework that integrates PQCs with attention mechanisms. • We design a quantum-enhanced neighborhood encoding strategy that captures multi-hop relational information in network graphs. We implement and analyze the impact of quantum noise and highlight the robustness of the proposed approach under realistic NISQ settings. • We conduct extensive experiments on four benchmark intrusion detection datasets, demonstrating that Q-AGNN achieves competitive or superior performance compared to state-of-the-art GNN-based methods while minimizing false positive rates. • We train and evaluate Q-AGNN directly on actual IBM quantum hardware using a feasible subset of real network flows, providing concrete experimental evidence of the practical viability of the proposed approach under realistic quantum noise. These results establish an important stepping stone toward deployment on fault-tolerant quantum systems. To the best of our knowledge, this study represents one of the earliest demonstrations of a quantum-enhanced graph neural network with attention mechanisms applied to intrusion detection in noisy quantum hardware environments. 2 Preliminaries and Related Work This section provides a review of relevant works at the intersection of graph-based intrusion detection and quantum machine learning, as summarized in Table 1 and positions the proposed Q-AGNN architecture within the landscape. GNNs have emerged as a dominant paradigm for modeling relational structure in network traffic, consistently outperforming tabular and sequence-based methods in intrusion detection tasks. Early work demonstrated that explicitly encoding network topology enables more accurate detection of coordinated and distributed attacks. [6] proposed the Spatial-Temporal Graph Convolutional Network (ST-GCN), which models SDN switches as graph nodes and captures both structural and temporal dependencies, achieving significant improvements in DDoS detection. Subsequent research has moved beyond static, topology-driven graphs toward feature-driven graph construction. [11] modeled individual network flows as nodes connected via feature similarity, enabling improved detection of stealthy and evasive attacks that do not manifest through fixed IP or port-based relationships. In parallel, Lo et al. [20] proposed E-GraphSAGE, which incorporates edge attributes into neighborhood aggregation and demonstrates that relational traffic features play a critical role in learning discriminative embeddings. Attention-based aggregation mechanisms further advanced this direction, as exemplified by the Multi-Graph GNN [4], which enables adaptive weighting of neighbors through a hybrid spectral-spatial architecture. In parallel, QML has gained attention as a means of capturing complex, non-linear patterns in network traffic. Prior studies have explored quantum autoencoders, quantum support vector classifiers, and hybrid quantum-classical generative models for anomaly and intrusion detection, demonstrating advantages in expressivity and robustness under certain conditions([10, 19, 28]). Frameworks such as QuantumNetSec([2]) further showed that carefully designed variational quantum circuits can achieve competitive performance on noisy intermediate-scale quantum (NISQ) devices. More recently, efforts have begun to combine graph learning with quantum computation. Quantum graph neural networks extend classical message passing into the quantum ([35, 36, 9, 15]), while QEGraphSAGE([12]) integrates variational quantum circuits into edge-aware graph learning for IoT traffic analysis. Closest to this work, [25] proposed the Quantum Graph Attention Network (QGAT), which replaces classical attention with variational quantum circuits to generate multiple attention coefficients via quantum parallelism. Building on these advances, we propose Q-AGNN, a hybrid quantum-classical graph architecture that uniquely combines quantum-enhanced node feature encoding with multi-hop attention-based neighborhood aggregation for flow-level intrusion detection. Unlike prior quantum GNNs, Q-AGNN explicitly targets realistic intrusion detection settings and is evaluated under practical quantum noise constraints. Table 1: Comparison of representative graph-based and quantum-enhanced intrusion detection approaches. Method Graph Attn111Attn: Attention; Q-Enc: Quantum Encoding. Q-Enc111Attn: Attention; Q-Enc: Quantum Encoding. Hybrid Noise Task ST-GCN Topology No No No N/A DDoS E-GraphSAGE Feature No No No N/A IDS Multi-Graph GNN Multi-view Yes No No N/A IDS QuantumNetSec None No Yes Yes Yes IDS QEGraphSAGE Feature No Yes Yes Limited IoT IDS QGAT Feature Quantum Yes Yes Limited Generic Graph Q-AGNN (Ours) Feature Multi-hop Yes Yes Yes IDS 3 Proposed Methodology In the standard formulation, Graph Convolutional Networks (GCNs)([17]) assume that all neighboring nodes contribute equally to the representation of a target node. The layer-wise propagation rule of a GCN is H(l+1)=σ(D^−12A^D^−12H(l)W(l)),H^(l+1)=σ\! ( D^- 12 A D^- 12H^(l)W^(l) ), (1) where A^=A+I A=A+I, with A∈0,1N×NA∈\0,1\^N× N being the adjacency matrix of the graph (where Aij=1A_ij=1 if node i is connected to node j, and 0 otherwise) and II denoting the identity matrix, which adds self-loops to all nodes; D D is the corresponding degree matrix; H(l)H^(l) and H(l+1)H^(l+1) denote the node feature matrices at layers l and l+1l+1, respectively; W(l)W^(l) is a trainable weight matrix; and σ(⋅)σ(·) denotes a nonlinear activation function. This normalization-based aggregation treats all neighbors uniformly, which may be suboptimal in many real-world applications. Figure 1: Overview of the proposed Q-AGNN architecture. Node features are encoded into quantum states via angle encoding and transformed using parameterized quantum circuits. Quantum-enhanced embeddings are aggregated using one-hop and two-hop attention mechanisms, followed by a classical MLP for intrusion detection. To address scalability and generalization limitations of GCNs, GraphSAGE was proposed as an inductive framework that learns aggregation functions over sampled neighborhoods rather than relying on the full graph structure([14]). In GraphSAGE, the representation of node i is updated by first aggregating its neighbors’ features using a predefined function and then combining this aggregated representation with the node’s own features, expressed as hi′=σ(W[hi∥AGG(hj:j∈(i))]),h_i =σ\! (W [h_i\;\|\;AGG\! ( \h_j:j (i) \ ) ] ), (2) where hih_i and hi′h_i denote the embedding of node i before and after aggregation, respectively; AGG(⋅)AGG(·) is a predefined aggregation function such as mean or pooling; and WW is a trainable weight matrix. While GraphSAGE enables inductive learning on large or evolving graphs, its aggregation strategy treats all neighbors equally, which can dilute critical signals in security-sensitive applications such as IDS. In graph-based IDS, nodes may represent hosts or network flows, and edges represent communication relationships. Some neighbors correspond to benign traffic, while others may exhibit anomalous behavior such as abnormal packet rates, bursty communication, or irregular timing patterns. Fixed aggregation functions are unable to explicitly distinguish between such heterogeneous interactions. This limitation motivates the use of Graph Attention Networks (GATs)([34]), which introduce a self-attention mechanism to adaptively weight neighbor contributions. In GATs, the updated feature representation of node i is computed as hi′=σ(∑j∈(i)αijWhj),h_i =σ ( _j (i) _ijWh_j ), (3) where the attention coefficients αij _ij reflect the importance of neighbor j to node i and are obtained using a shared attention mechanism: αij=exp⁡(LeakyReLU(a⊤[Whi∥Whj]))∑k∈(i)exp⁡(LeakyReLU(a⊤[Whi∥Whk])). _ij= (LeakyReLU (a [Wh_i\,\|\,Wh_j] ) ) _k (i) (LeakyReLU (a [Wh_i\,\|\,Wh_k] ) ). (4) By learning data-dependent attention weights, graph attention networks (GATs) selectively amplify security-critical neighborhood interactions while attenuating redundant or weakly informative connections, making them particularly effective for graph-based intrusion detection. Building on these classical GNN principles, we propose Q-AGNN, a hybrid quantum-classical graph architecture that integrates parameterized quantum circuits (PQCs) with explicit multi-hop attention-based aggregation to learn expressive node representations for intrusion detection. Fig. 1 presents an overview of the Q-AGNN framework, while Algorithm 1 details the end-to-end training and inference procedure. Each node feature vector xi∈ℝFx_i ^F is first mapped to a quantum state using angle encoding: |ψi⟩=⨂k=1FRY(xik)|0⟩,| _i = _k=1^FR_Y(x_ik)|0 , (5) where RY(⋅)R_Y(·) denotes a rotation about the Y axis parameterized by feature xikx_ik. The encoded state is then transformed by a parameterized EfficientSU2 ansatz with linear entangling topology. Each layer ℓ=1,…,L =1,…,L of the ansatz applies pre-entangling single-qubit rotations RYR_Y and RZR_Z on all qubits, followed by linear CNOT gates connecting qubit q to q+1q+1, and then post-entangling rotations. One layer is expressed as: UEffSU2(ℓ) U_EffSU2^( ) =⨂q=1FRY(θq(ℓ,1))RZ(θq(ℓ,2))⏟pre-entangling = _q=1^FR_Y( _q^( ,1))R_Z( _q^( ,2))_pre-entangling ∏q=1F−1CNOTq,q+1⏟entanglement _q=1^F-1CNOT_q,q+1_entanglement ⨂q=1FRY(θq(ℓ,3))RZ(θq(ℓ,4))⏟post-entangling. _q=1^FR_Y( _q^( ,3))R_Z( _q^( ,4))_post-entangling. (6) The final quantum state after L layers is |ϕi()⟩=UEffSU2(L)⋯UEffSU2(1)|ψi⟩,| _i( θ) =U_EffSU2^(L)·s U_EffSU2^(1)| _i , (7) where θ denotes all trainable parameters. A classical embedding is obtained via expectation values of observables: zi=⟨ϕi()|O^|ϕi()⟩,z_i= _i( θ)\,|\, O\,|\, _i( θ) , (8) where ziz_i is the PQC-generated embedding of node i. Rather than iterative message passing, Q-AGNN performs explicit multi-hop aggregation through powers of the adjacency matrix. Let A be the adjacency matrix of the cosine-similarity graph. One-hop and two-hop interactions are obtained as A(1)=A,A(2)=A2.A^(1)=A, A^(2)=A^2. (9) The aggregated embeddings are computed as polynomial graph filters: zi(1)=∑j∈VAij(1)αj(1)zj,z_i^(1)= _j∈ VA^(1)_ij\,α^(1)_j\,z_j, (10) zi(2)=∑j∈VAij(2)αj(2)zj,z_i^(2)= _j∈ VA^(2)_ij\,α^(2)_j\,z_j, (11) where zjz_j is the PQC embedding of neighboring node j. The attention weights α(1)α^(1) and α(2)α^(2) are computed at the node level and normalized globally across the graph, emphasizing informative nodes during aggregation. The final node representation combines original and aggregated embeddings: hi′=σ(zi+zi(1)+zi(2)),h_i =σ (z_i+z_i^(1)+z_i^(2) ), (12) where σ(⋅)σ(·) is a nonlinear activation. A multilayer perceptron produces the intrusion score: yi=MLP(hi′).y_i=MLP(h_i ). (13) Algorithm 1 Q-AGNN 0: Graph G=(V,E)G=(V,E) with adjacency matrix A, node features i∈ℝFi∈V\h_i ^F\_i∈ V, quantum parameters θ, attention parameters (1),(2) α^(1), α^(2) 0: Node-level intrusion scores yii∈V\y_i\_i∈ V Quantum Embedding Generation (Node-wise) 1: for each node i∈Vi∈ V do 2: Encode node features: |ψi⟩=⨂k=1FRY(hik)|0⟩| _i = _k=1^FR_Y(h_ik)|0 3: Apply parameterized circuit: |ϕi⟩=U()|ψi⟩| _i =U( θ)| _i 4: Measure observables to obtain embedding: zi=⟨ϕi|O^|ϕi⟩z_i= _i| O| _i 5: end forPolynomial Multi-Hop Aggregation 6: Compute one-hop adjacency (1)=A^(1)=A 7: Compute two-hop adjacency (2)=2A^(2)=A^2 8: for each node i∈Vi∈ V do 9: One-hop aggregation:zi(1)=∑j∈VAij(1)αj(1)zjz_i^(1)= _j∈ VA^(1)_ij\,α^(1)_j\,z_j 10: Two-hop aggregation:zi(2)=∑j∈VAij(2)αj(2)zjz_i^(2)= _j∈ VA^(2)_ij\,α^(2)_j\,z_j 11: end forFusion and Prediction 12: for each node i∈Vi∈ V do 13: Fuse embeddings: hi′←σ(zi+zi(1)+zi(2))h_i ←σ\! (z_i+z_i^(1)+z_i^(2) ) 14: Predict intrusion score: yi←MLP(hi′)y_i (h_i ) 15: end for 16: return yii∈V\y_i\_i∈ V This formulation reveals that Q-AGNN implements a second-order polynomial graph filter over quantum-enhanced node embeddings, decoupling nonlinear feature transformation in the quantum domain from efficient relational aggregation in the classical domain. Algorithm 1 summarizes the forward pass of the proposed Q-AGNN model. The hybrid quantum-classical design of Q-AGNN is motivated by the complementary strengths of quantum computation and classical graph learning. Parameterized EfficientSU2 circuits induce highly nonlinear transformations in a high-dimensional Hilbert space, where quantum superposition enables compact representation of multiple feature interactions and entanglement captures higher-order correlations. These properties are well suited to intrusion detection, in which malicious behavior often manifests through subtle, non-linear dependencies among traffic statistics, temporal dynamics, and protocol attributes. At the same time, current quantum hardware operates in the NISQ regime([27]), with limited qubit counts, shallow circuit depths, and non-negligible noise. Fully quantum graph learning pipelines are therefore not yet practical. Q-AGNN adopts a hybrid architecture in which PQCs serve as expressive feature encoders, while classical attention-based aggregation and prediction layers provide scalability, numerical stability, and efficient optimization. This design supports end-to-end training on classical hardware, noisy simulators, and near-term quantum processors without assuming fault-tolerant quantum computation. From a longer-term perspective, Q-AGNN represents a transitional architecture toward fully quantum graph learning. As scalable fault-tolerant quantum systems mature, increasingly larger components of the pipeline, including neighborhood aggregation and attention computation, may be migrated to the quantum domain. In such settings, quantum superposition could enable parallel exploration of multiple neighborhood configurations, while entanglement may naturally encode multi-hop relational structure without explicit iterative message passing. The proposed framework is therefore both practically viable in the near term and aligned with the anticipated evolution of quantum computing. 3.1 Design Considerations The architectural design of Q-AGNN is grounded in established principles from quantum feature maps and graph signal processing, with the objective of achieving high representational expressivity while ensuring numerical stability and efficient optimization under realistic computational constraints. Let v∈ℝdx_v ^d denote the classical feature vector associated with node v. Each node feature is encoded by a parameterized quantum circuit (PQC) U(v,)U(x_v, θ) acting on n qubits, producing a quantum state |ψ(v)⟩=U(v,)|0⟩⊗n. ψ(x_v)=U(x_v, θ) 0 n. (14) Node embeddings are obtained by measuring expectation values of Pauli observables Pk\P_k\, zv,k=⟨ψ(v)|Pk|ψ(v)⟩,z_v,k= ψ(x_v)|P_k|ψ(x_v) , (15) yielding a bounded embedding zv,k∈[−1,1]z_v,k∈[-1,1] for all v and k. This boundedness is an intrinsic property of quantum expectation values and ensures that the resulting feature representations remain uniformly bounded when composed with classical neural layers. Consequently, Q-AGNN avoids uncontrolled activation growth and supports numerically stable gradient-based optimization. Rather than employing deep iterative message passing, Q-AGNN performs explicit aggregation over first- and second-order neighborhoods using powers of the adjacency matrix. Let A∈ℝN×NA ^N× N denote the adjacency matrix of the cosine-similarity graph constructed over network flows. The aggregation operator is defined over A and A2A^2, corresponding to one-hop and two-hop neighborhoods, respectively. This formulation aligns with polynomial graph filtering approaches in graph signal processing, where higher-order structural information is captured through low-degree polynomials of the graph shift operator rather than deep propagation([8, 21]). From this perspective, the aggregation step in Q-AGNN can be expressed as a second-order polynomial graph filter applied to the PQC-generated embeddings: H=(β0I+β1A+β2A2)Z,H= ( _0I+ _1A+ _2A^2 )Z, (16) where Z∈ℝN×dqZ ^N× d_q is the matrix of quantum embeddings, and βkk=02\ _k\_k=0^2 are implicit, data-dependent coefficients induced by the attention mechanism. Polynomial filters of this form are known to balance locality and contextual awareness while avoiding the excessive smoothing associated with deep message passing([26, 13, 22]). By restricting aggregation to low-order neighborhoods, Q-AGNN preserves discriminative local structure while incorporating sufficient relational context for intrusion detection. The attention mechanism further refines this filtering process by introducing adaptive, node-level weighting. Let αv _v denote the learned attention weight associated with node v, normalized across the graph. These weights modulate the contribution of each node during aggregation, effectively reweighting the polynomial filter coefficients in a data-dependent manner. Unlike edge-wise attention mechanisms, this node-centric formulation reduces computational complexity and is well suited to PQC-based embeddings, where repeated quantum evaluations are costly. This design is particularly advantageous in similarity-based traffic graphs, where benign and malicious flows may occupy overlapping neighborhoods and selective emphasis is required. Crucially, Q-AGNN adopts a hybrid quantum-classical decomposition that cleanly separates nonlinear feature transformation from relational reasoning. The PQC is evaluated once per node to generate expressive embeddings, while all neighborhood aggregation, attention weighting, and classification are performed classically. This separation avoids repeated quantum circuit evaluations during message passing and enables efficient end-to-end training using standard optimization methods on classical hardware, noisy simulators, or near-term quantum devices. 3.2 Theoretical Foundations: Quantum Feature Maps and Expressivity The design of Q-AGNN is grounded in principles from quantum feature mapping and kernel theory, which provide insight into the representational advantages of parameterized quantum circuits (PQCs) over classical feature transformations. 3.2.1 PQC as an Implicit Quantum Feature Map Let x∈ℝdx ^d denote the classical feature vector associated with a network flow. The PQC used in Q-AGNN defines a nonlinear mapping Φθ:ℝd→ℋ,x↦|ψ(x;θ)⟩, _θ:R^d , x |ψ(x;θ) , (17) where ℋH is a Hilbert space of dimension 2n2^n for an n-qubit system. This mapping is realized via angle encoding followed by a parameterized EfficientSU2 ansatz. Node embeddings are obtained through expectation values of as set of Pauli observables Pk\P_k\: zk(x)=⟨ψ(x;θ)|Pk|ψ(x;θ)⟩.z_k(x)= ψ(x;θ)|P_k|ψ(x;θ) . (18) These expectation values define coordinates of x under an implicit quantum feature map. The kernel function induced by this measurement strategy between two inputs x and x′x is: Kθ(x,x′)=∑k⟨ψ(x;θ)∣Pk∣ψ(x;θ)⟩⋅⟨ψ(x′;θ)∣Pk∣ψ(x′;θ)⟩K_θ(x,x )= _k ψ(x;θ) P_k ψ(x;θ) · ψ(x ;θ) P_k ψ(x ;θ) (19) This is the Pauli expectation kernel, which measures similarity through the alignment of observable measurement outcomes in the quantum feature space. We note this is distinct from the fidelity kernel: Kθfid(x,x′)=|⟨ψ(x;θ)∣ψ(x′;θ)⟩|2K_θ^fid(x,x )= | ψ(x;θ) ψ(x ;θ) |^2 (20) which measures quantum state overlap directly. While both kernels are positive semi-definite and theoretically valid, Q-AGNN implicitly operates under the Pauli expectation kernel, since node embeddings are derived from observable measurements rather than state inner products. The two kernels coincide only in special cases, such as when the observable set PkP_k forms a complete basis for the operator space, i.e., when tomographically complete measurements are performed. In the general case with a finite, fixed set of Pauli observables, the Pauli expectation kernel captures a projected view of the full quantum state geometry, sufficient for learning discriminative representations while remaining computationally tractable on near-term hardware. 3.2.2 Expressivity Compared to Classical MLPs A classical multilayer perceptron (MLP) with ReLU or sigmoid activations implements piecewise-linear function approximation whose expressivity grows polynomially with width and depth. In contrast, PQCs composed of parameterized rotations and entangling gates generate functions of the form f(x)=⟨0|U†(x,θ)O^U(x,θ)|0⟩,f(x)= 0|U (x,θ) OU(x,θ)|0 , (21) which, following Schuld et al.([32]), can be expressed as truncated Fourier series over the input features: f(x)=∑ω∈Ωcωeiω⋅x,f(x)= _ω∈ c_ω\,e^i\,ω· x, (22) where ω denotes the set of accessible frequency components and cωc_ω are complex coefficients determined by the trainable parameters θ. Critically, the accessible frequency spectrum Ω is not arbitrary, it is determined by the eigenvalue spectrum of the encoding Hamiltonian. Under RYR_Y angle encoding, each qubit contributes a generator with eigenvalues −12,+12\- 12,+ 12\, yielding a frequency spectrum bounded by Ω⊆−n2,…,0,…,n2, \- n2,…,0,…, n2 \, (23) for an n-qubit system. In Q-AGNN, with n=4n=4 qubits, the maximum accessible frequency per input feature is therefore ±2± 2, corresponding to second-order trigonometric interactions of the form 1,sin⁡(xi),cos⁡(xi),sin⁡(2xi),cos⁡(2xi).\1, (x_i), (x_i), (2x_i), (2x_i)\. (24) Cross-feature interactions arise through entangling CNOT gates, which introduce multiplicative coupling between qubits, enabling terms such as sin⁡(xi)cos⁡(xj),cos⁡(xi)cos⁡(xj). (x_i) (x_j), (x_i) (x_j). (25) However, the total number of accessible Fourier coefficients grows as O(n2)O(n^2) under linear entangling topology rather than exponentially, due to the constrained connectivity of the EfficientSU2 ansatz employed. This represents a meaningful but bounded expressivity advantage over a shallow classical MLP of equivalent parameter count, rather than an unrestricted exponential advantage. Furthermore, the concentration of measure phenomenon implies that in high-dimensional quantum systems, random parameter initializations tend to produce expectation values concentrated near zero, potentially leading to flat optimization landscapes (barren plateaus) during training. For the shallow, 4-qubit circuits used in Q-AGNN, this effect is limited but non-negligible and is partially mitigated by the hybrid classical optimization pipeline. While PQCs introduce a structurally distinct inductive bias compared to classical MLPs by embedding feature interactions through unitary evolution rather than layered nonlinear compositions, the practical expressivity advantage in Q-AGNN is more accurately described as moderate and dependent on circuit depth rather than inherently superior. This consideration is especially relevant in the four-qubit shallow-circuit regime imposed by current NISQ hardware constraints. It also motivates future investigation of deeper parameterized circuits and alternative encoding strategies as fault-tolerant quantum hardware becomes available. 3.2.3 Boundedness and Optimization Stability Expectation values of Pauli observables satisfy zk(x)∈[−1,1],z_k(x)∈[-1,1], (26) ensuring that quantum embeddings remain uniformly bounded. This boundedness prevents uncontrolled activation growth and promotes numerical stability when embeddings are composed with classical graph filters. Unlike classical deep networks, which may require explicit normalization strategies to mitigate exploding or vanishing activations, PQC-based embeddings inherently preserve bounded magnitudes due to the unitary nature of quantum evolution. This boundedness property provides an additional practical advantage under realistic NISQ conditions. Depolarizing noise, which is the dominant error channel on current quantum hardware, drives quantum states toward the maximally mixed state and causes expectation values of Pauli observables to decay toward zero approximately by a factor of (1−p)d(1-p)^d, where p denotes the depolarizing error rate per gate and d represents the circuit depth. Since PQC embeddings in Q-AGNN are inherently bounded within the interval [−1,1][-1,1], this noise-induced attenuation does not generate out-of-distribution outputs. Instead, it produces embeddings that remain within the same bounded range encountered during training, although with reduced magnitude. As a result, the downstream attention mechanism and polynomial graph filter receive inputs whose distributional support is preserved under noise, even when individual embedding magnitudes are attenuated. In contrast, classical deep networks with unbounded activations may exhibit large deviations under hardware-induced perturbations, as activation magnitudes are not inherently constrained. The empirical consistency observed between noiseless and noisy simulation results in Section 6.4 aligns with this theoretical expectation. 3.2.4 Interaction with Polynomial Graph Filtering Q-AGNN applies a second-order polynomial graph filter over PQC embeddings: H=(β0I+β1A+β2A2)Z,H=( _0I+ _1A+ _2A^2)Z, (27) where Z contains the PQC-generated node embeddings. From a kernel perspective, this corresponds to performing relational smoothing in a quantum-induced feature space. The architecture therefore decouples nonlinear feature transformation (quantum domain) from relational aggregation (classical graph filtering). This separation avoids repeated quantum circuit evaluations during message passing while preserving expressive nonlinear embeddings. The resulting model can thus be interpreted as performing kernelized graph filtering with a learnable quantum feature map. 3.2.5 Implications for Intrusion Detection Intrusion detection requires modeling nonlinear dependencies across traffic statistics and relational propagation across communication patterns. The PQC provides expressive nonlinear transformations capable of capturing higher-order correlations, while the polynomial graph filter propagates these representations across network structure. Consequently, Q-AGNN combines: • Exponential Hilbert-space embedding capacity, • Bounded expectation-value representations, • Efficient low-order graph signal propagation. This theoretical foundation explains the empirical improvements observed across multiple datasets and clarifies the structural advantages of hybrid quantum-classical graph learning for cybersecurity applications. 4 Experimental Setup This section presents a rigorous experimental framework and a set of diverse benchmark datasets used to comprehensively evaluate the effectiveness of the proposed Q-AGNN model. Within this framework, experiments are conducted on four representative network traffic datasets selected to capture both analytical complexity and practical deployment considerations. The selected datasets span multiple levels of data granularity, ranging from feature-rich flow representations derived from packet captures to compact NetFlow summaries commonly adopted in operational intrusion detection systems. This design choice enables a comprehensive assessment of Q-AGNN’s ability to learn discriminative structural patterns while maintaining robustness and efficiency under realistic computational constraints. • BoT-IoT([18]): This dataset was generated in a controlled cyber range environment at UNSW Canberra to emulate realistic Internet-of-Things (IoT) botnet scenarios. The dataset combines benign traffic with diverse attack behaviors originating from compromised IoT devices. To facilitate experimental tractability, the dataset authors provide a representative subset comprising 5% of the full dataset, along with the corresponding 10 most discriminative features, while preserving attack diversity. This author-provided subset is used in all experiments in this study. • UNSW-NB15([23]): It is a large-scale benchmark dataset created using the IXIA PerfectStorm traffic generator. It consists of labeled network flow records characterized by rich statistical and protocol-level features. Its heterogeneity and complexity make it particularly suitable for evaluating Q-AGNN’s capacity to capture subtle attack signatures and complex inter-feature dependencies. • NetFlow Variants (NF-BoT-IoT, NF-UNSW-NB15)([30]): NetFlow-based abstraction of the original BoT-IoT and UNSW-NB15 datasets, where packet-level traffic is summarized into a compact set of canonical flow attributes. This representation reflects realistic monitoring conditions in operational networks and is used to assess the scalability and robustness of Q-AGNN under lightweight data representations. Together, these four datasets provide a balanced and realistic evaluation setting, enabling a systematic examination of the performance, generalization capability, and practical applicability of Q-AGNN across heterogeneous traffic representations and deployment scenarios. 4.1 Data Preprocessing Data preprocessing is a critical component of the experimental pipeline. All features containing missing values (NaNs) are removed to ensure data consistency and reliability. Categorical attributes are converted into numerical representations using the LabelEncoder provided by the scikit-learn library. To enable execution on quantum simulators and mitigate computational overhead, traffic records are aggregated based on unique source IP-destination IP pairs. For each such pair, numerical feature values are summarized using their mean, while the associated class label is determined by the most frequently occurring label within the group. This grouping step makes the data more manageable for quantum simulation while retaining the representative traffic behavior needed for evaluation. After aggregation, all features are scaled using Min-Max normalization to map values into a uniform range. Given the limitations of NISQ devices and the exponential cost of quantum circuit simulation, Principal Component Analysis (PCA) is subsequently applied to reduce the feature space to four dimensions, matching the practical input constraints of the quantum model. The resulting dataset is then partitioned into 70% training, 15% validation, and 15% testing splits to support robust model training, hyperparameter tuning, and unbiased performance evaluation. 4.2 Graph Construction The representation of network traffic as a graph is a critical component of the proposed Q-AGNN framework. In this approach, each node in the graph corresponds to a single network flow, defined by a unique source-destination IP address pair([11]). Let xi∈ℝFx_i ^F denote the feature vector associated with node i. Edges between nodes are established based on feature-level similarity rather than physical proximity. Specifically, the cosine similarity between node feature vectors is computed, and a bi-directional edge is introduced whenever the similarity exceeds a threshold of 0.90.9. Formally, an edge eij∈Ee_ij∈ E exists if xi.xj‖xi‖‖xj‖≥0.9. x_i.x_j\|x_i\|\,\|x_j\|≥ 0.9. (28) Figure 2: Illustration of the network flow graph construction. Each node represents a unique flow, and edges connect nodes with highly similar feature vectors. Table 2: Graph statistics for training, validation, and testing sets. Dataset Training Validation Testing Nodes Edges Nodes Edges Nodes Edges BoT-IoT 109 1,392 23 66 24 40 NF-BoT-IoT 126 5,870 27 296 28 324 UNSW-NB15 217 11,086 47 392 47 578 NF-UNSW-NB15 205 8,752 44 496 45 456 Fig. 2 presents a simple example of the resulting graph. Consider two network flows between IP address pairs 10.0.0.5-10.0.0.12 and 10.0.1.8-10.0.1.20, represented by feature vectors x0x_0 and x1x_1, respectively. These flows are mapped to nodes in the graph, and a bi-directional edge is formed if their feature vectors exhibit sufficient cosine similarity. This allows the model to capture relationships between flows that share similar behavioral patterns, even if they originate from distinct endpoints. This similarity-driven graph construction enables Q-AGNN to effectively model interactions among network flows, facilitating the learning of discriminative representations for intrusion detection. A summary of the resulting graph statistics for each dataset is provided in Table 2. 5 Training Configuration The proposed quantum-classical hybrid model was implemented using Python 3.11.7 with PyTorch 2.5.1+cu121, Qiskit 2.2.3, and Qiskit Machine Learning 0.9.0, enabling efficient GPU-accelerated computation and seamless integration of quantum neural network layers. Model training employed the Adam optimizer with a learning rate of 0.01 and an L2 regularization (weight decay) of 1×10−21× 10^-2. The maximum number of epochs was set to 1000, and early stopping was applied if the validation loss did not improve for 30 consecutive epochs to prevent overfitting. During training, the Binary Cross-Entropy with Logits loss function was utilized to optimize the model for binary anomaly detection. For evaluation, the logits were passed through a sigmoid activation, with a threshold of 0.5 used to classify instances as anomalous or benign. Quantum computations were performed using different backends depending on the experiment. For noiseless simulations, the StatevectorEstimator was employed to obtain exact expectation values. To assess model performance under realistic noise conditions, an actual IBM Quantum backend was randomly selected from available operational devices for training. The noise profile from this backend was imported into the AerSimulator (using the density matrix method) to construct a noisy quantum simulator, which was then used with BackendEstimatorV2 and a default precision of 0.03. For testing, the same backend and corresponding noise model were used to ensure consistency and reproducibility of the results. This setup enabled a controlled evaluation of the impact of quantum noise on model performance. For comparison, baseline models including GCN, GAT, GraphSAGE, ClusterGCN, GINConv, SuperGAT and TransformerConv were trained under identical conditions. Each baseline employed two hidden layers to capture two-hop neighborhood information, and the hidden layer dimensions were carefully selected such that the total number of trainable parameters closely matched that of the proposed quantum-classical framework. This ensured a fair and consistent evaluation across all models. 6 Results and Discussion In this section, we present a comprehensive evaluation of the proposed Q-AGNN framework for network intrusion detection. The performance is assessed across multiple benchmark datasets, using a range of classical GNN baselines for comparison. We analyze the results under three key experimental settings: ideal, noise-free quantum estimation using the StatevectorEstimator, noisy quantum simulation with hardware-calibrated noise, and execution on real IBM quantum hardware. To evaluate the performance of Q-AGNN and for tractability, we adopt standard classification metrics commonly used in IDS. These include accuracy, precision, recall (detection rate), F1-score, false positive rate (FPR), false negative rate (FNR), and specificity, providing a comprehensive assessment of both detection effectiveness and error characteristics. For precision, recall, and F1-score, we report macro-averaged values to equally consider both the normal and attack classes. This is particularly important in imbalanced intrusion detection datasets, where the attack class may be underrepresented. In contrast, FPR, FNR, and specificity are reported per class to provide operational insights, with a special focus on minimizing false positives to ensure practical applicability in real-world network monitoring. 6.1 Embedding Space Analysis of PQC vs MLP (a) (b) (c) (d) (e) (f) (g) (h) Figure 3: t-SNE visualization comparing embeddings produced by MLP and PQC. (a)-(b): BoT-IoT dataset (MLP vs PQC). (c)-(d): NF-BoT-IoT dataset (MLP vs PQC). (e)-(f): UNSW-NB15 dataset (MLP vs PQC). (g)-(h): NF-UNSW-NB15 dataset (MLP vs PQC). Green points represent normal traffic and red points represent attack traffic. Before evaluating performance across different quantum execution settings, we first analyze the geometry of the node embeddings learned by the model. Fig. 3 presents t-SNE visualizations of these embeddings across all datasets for both the classical MLP-based model and the PQC-based model. A consistent contrast is observed between the classical MLP and the PQC embeddings. The embeddings produced by the MLP are widely dispersed, with red and green points scattered across the space and frequently overlapping. The distribution appears noisy and unstructured, with no clear formation of compact regions corresponding to a specific class. In many areas, samples from different classes occupy the same neighborhoods, indicating that the MLP does not learn a representation space where similar behaviors are grouped together. Instead, it leaves much of the burden of discrimination to the final classifier layer rather than embedding the separability directly into the feature geometry. In contrast, the PQC embeddings exhibit a markedly different spatial organization. Points belonging to the same class form visibly compact clusters, while samples from different classes are pushed into distinct regions of the space with noticeable gaps between them. The clusters are not only tighter but also more geometrically coherent, suggesting that the PQC transformation reshapes the feature space in a way that naturally groups similar traffic patterns together. This indicates that class separability is already encoded at the representation level, making the subsequent classification task significantly easier. Another notable difference is the reduction in embedding spread. While MLP embeddings are stretched across large regions of the space, PQC embeddings occupy more confined and structured areas. This compactness reflects a form of implicit regularization, where the PQC discourages noisy or redundant representations and instead produces well-organized manifolds corresponding to underlying traffic behaviors. 6.2 Sensitivity to Cosine Similarity Threshold Table 3: Sensitivity of Q-AGNN to cosine similarity threshold on NF-UNSW-NB15. Threshold F1-score FPR 0.60 0.816 0.000 0.70 0.949 0.000 0.80 1.000 0.000 0.90 0.949 0.000 0.95 0.949 0.000 As shown in Table 3, Q-AGNN exhibits stable performance across a broad range of cosine similarity thresholds. Increasing the threshold from 0.60 to 0.70 leads to a substantial improvement in F1-score, indicating that overly dense graphs introduce weak or noisy neighbor relationships that hinder effective aggregation. Thresholds between 0.70 and 0.95 yield consistently strong performance, with no observed increase in the false positive rate. While the highest F1-score is achieved at a threshold of 0.80, we select 0.90 as the default value to favor sparser graphs with higher-confidence edges. This choice reduces graph density and computational overhead while maintaining stable detection performance, and is therefore better aligned with practical deployment considerations. 6.3 Performance Using Statevector-Based Quantum Estimation Table 4: Performance of Q-AGNN, GCN, GAT, GraphSAGE, ClusterGCN, GINConv, SuperGAT, and TransformerConv across datasets. Precision, recall, and F1-score are reported as macro-averaged to equally account for both normal and attack classes. Accuracy, FPR, FNR, and specificity are reported as global or per-class metrics. Best metrics are highlighted in bold. Datasets Model Acc Prec Rec F1 FPR FNR Spec BoT-IoT Q-AGNN (Ours) 1.00 1.00 1.00 1.00 0.00 0.00 1.00 GCN 0.9167 0.9524 0.8000 0.8500 0.00 0.4000 1.00 GAT 0.9167 0.9524 0.8000 0.8500 0.00 0.4000 1.00 GraphSAGE 0.9167 0.9524 0.8000 0.8500 0.00 0.4000 1.00 ClusterGCN 0.9167 0.9524 0.8000 0.8500 0.00 0.4000 1.00 GINConv 0.8333 0.7778 0.8947 0.7983 0.2105 0.00 0.7895 SuperGAT 0.9167 0.9524 0.8000 0.8500 0.00 0.4000 1.00 TransformerConv 0.9167 0.9524 0.8000 0.8500 0.00 0.4000 1.00 NF-BoT-IoT Q-AGNN (Ours) 0.8929 0.8106 0.8565 0.8303 0.0870 0.2000 0.9130 GCN 0.8929 0.8106 0.8565 0.8303 0.0870 0.2000 0.9130 GAT 0.7857 0.4074 0.4783 0.4400 0.0435 1.0000 0.9565 GraphSAGE 0.9643 0.9167 0.9783 0.9434 0.0435 0.00 0.9565 ClusterGCN 0.9286 0.8571 0.9565 0.8939 0.0870 0.00 0.9130 GINConv 0.7500 0.7083 0.8478 0.7044 0.3043 0.00 0.6957 SuperGAT 0.8929 0.8333 0.7783 0.8014 0.0435 0.4000 0.9565 TransformerConv 0.9286 0.8571 0.9565 0.8939 0.0870 0.00 0.9130 UNSW-NB15 Q-AGNN (Ours) 0.9787 0.9881 0.9167 0.9485 0.00 0.1667 1.00 GCN 0.9787 0.9881 0.9167 0.9485 0.00 0.1667 1.00 GAT 0.9787 0.9881 0.9167 0.9485 0.00 0.1667 1.00 GraphSAGE 0.9787 0.9881 0.9161 0.9485 0.00 0.1667 1.00 ClusterGCN 0.9787 0.9881 0.9161 0.9485 0.00 0.1667 1.00 GINConv 0.8723 0.4362 0.5000 0.4659 0.00 1.0000 1.00 SuperGAT 0.9787 0.9881 0.9161 0.9485 0.00 0.1667 1.00 TransformerConv 0.9574 0.9045 0.9045 0.9045 0.0244 0.1667 0.9756 NF-UNSW-NB15 Q-AGNN (Ours) 0.9111 0.9535 0.6667 0.7256 0.00 0.6667 1.00 GCN 0.8667 0.4333 0.5000 0.4643 0.00 1.0000 1.00 GAT 0.8667 0.4333 0.5000 0.4643 0.00 1.0000 1.00 GraphSAGE 0.8667 0.4333 0.5000 0.4643 0.00 1.0000 1.00 ClusterGCN 0.8667 0.4333 0.5000 0.4643 0.00 1.0000 1.00 GINConv 0.1778 0.2857 0.1026 0.1509 0.7949 1.0000 0.2051 SuperGAT 0.8667 0.4333 0.5000 0.4643 0.00 1.0000 1.00 TransformerConv 0.8667 0.4333 0.5000 0.4643 0.00 1.0000 1.00 As a baseline, we first evaluated the Q-AGNN model under ideal, noise-free conditions using Qiskit’s StatevectorEstimator. This estimator simulates the exact quantum state evolution without incorporating any hardware noise, providing an upper bound on the achievable model performance. Table 4 compares Q-AGNN with several classical GNN baselines across all datasets in this setting. The results show that Q-AGNN is highly competitive overall, achieving the best or tied-best performance on BoT-IoT, UNSW-NB15, and NF-UNSW-NB15. In particular, it attains perfect performance on BoT-IoT and strong results on UNSW-NB15, indicating that the proposed architecture can effectively separate benign and malicious traffic under ideal quantum estimation. For the more challenging NetFlow-based datasets, the results are more nuanced. On NF-BoT-IoT, GraphSAGE achieves the strongest overall performance, exceeding Q-AGNN in both macro-F1 and accuracy, while also obtaining a lower FPR. This suggests that on some datasets, well-established classical GNNs can remain highly competitive, or even superior, under the current experimental setting. However, on NF-UNSW-NB15, Q-AGNN yields the best overall balance among the reported metrics. Although several classical baselines achieve the same FPR as Q-AGNN, they do so with substantially worse recall and FNR, indicating that they miss a large fraction of attack instances. In contrast, Q-AGNN preserves zero FPR while improving recall and macro-F1, which is more desirable in intrusion detection scenarios where both false alarms and missed attacks are important. The strong performance of Q-AGNN on several datasets, particularly NF-UNSW-NB15, may be attributed to its use of PQC-based feature encoding, which can capture higher-order correlations in network traffic graphs that may be difficult for classical GNNs to represent. When combined with graph aggregation, this representation appears to improve the balance between false positives and missed attacks, as reflected in the favorable recall and macro-F1 scores on the more challenging settings. (a) (b) (c) (d) Figure 4: Train and validation loss curves for four datasets: (a) BoT-IoT, (b) NF-BoT-IoT, (c) UNSW-NB15, (d) NF-UNSW-NB15. Fig. 4 illustrates the training and validation loss dynamics of Q-AGNN across four datasets with varying graph characteristics. Distinct convergence behaviors are observed: rapid and smooth convergence for structurally cleaner graphs (NF-BoT-IoT and UNSW-NB15), slower stabilization for noisier IoT traffic graphs (BoT-IoT), and notably delayed improvement for the sparse and challenging NF-UNSW-NB15 graph. This variation indicates that the model adapts to the underlying graph complexity rather than memorizing features. Importantly, in all cases the validation loss closely follows the training loss, demonstrating stable generalization and the absence of overfitting while learning higher-order neighborhood representations. These results establish a strong performance benchmark in an ideal, noise-free setting. Importantly, they also indicate the potential benefits of harnessing the full capacity of quantum computation. As fault-tolerant quantum hardware becomes available, larger graphs and more expressive quantum embeddings can be processed, potentially further improving both detection accuracy and robustness. In essence, the current results provide empirical evidence that the Q-AGNN framework is well-positioned to scale with future quantum computational resources while maintaining operationally meaningful low false positive rates. 6.4 Performance Under Noisy Quantum Simulation with Hardware-Calibrated Noise To assess the robustness of the proposed Q-AGNN under realistic NISQ conditions, we evaluated the model on a noisy quantum simulator, where noise characteristics were imported from actual IBM quantum hardware. Table 5 summarizes the performance of the Q-AGNN model under noisy simulation settings across multiple datasets. Table 5: Performance of Q-AGNN under noisy simulation with hardware-calibrated noise (HW). Precision, recall, and F1-score are reported as macro-averaged to equally account for both normal and attack classes. Accuracy, FPR, FNR, and specificity are reported as global or per-class metrics. Model Dataset Acc Prec Rec F1 FPR FNR Spec HW Q-AGNN BoT-IoT 0.917 0.952 0.800 0.850 0.000 0.400 1.000 Torino NF-BoT-IoT 0.929 0.857 0.957 0.894 0.087 0.000 0.913 Torino UNSW-NB15 0.979 0.988 0.917 0.949 0.000 0.167 1.000 Torino NF-UNSW-NB15 0.867 0.433 0.500 0.464 0.000 1.000 1.000 Fez As observed, the Q-AGNN with imported hardware noise demonstrates performance largely consistent with the ideal StatevectorEstimator, achieving high accuracy, precision, and F1-score on the BoT-IoT and UNSW-NB15 datasets. Performance degradation is observed in certain datasets, particularly in the NF-UNSW-NB15 scenario, reflecting the impact of realistic quantum noise. Nevertheless, the results suggest that the proposed Q-AGNN framework remains feasible under hardware-calibrated noise conditions, although its robustness appears to be dataset-dependent and should be interpreted with caution when considering execution on actual quantum devices. 6.5 Execution on Real Quantum Hardware To complement the simulation study, we executed the Q-AGNN framework on actual IBM quantum hardware with the sole objective of validating the practical operability of the proposed pipeline and examining whether the training dynamics observed in simulation persist under real NISQ conditions. This experiment is intended as a feasibility and behavioral validation, not as a performance evaluation. (a) (b) Figure 5: Training and testing loss per epoch for Q-AGNN executed on IBM quantum hardware and on a noisy simulator using the same hardware-calibrated noise model. (a) Training loss. (b) Testing loss. Given the limitations of current NISQ devices, including restricted qubit counts, queue times, and limited execution budgets, a reduced subset of the pre-processed UNSW-NB15 dataset was used. Specifically, 8 nodes were randomly sampled for training (5 normal flows and 3 attack flows) and 5 nodes for testing (3 normal flows and 2 attack flows). This minimal configuration ensured reliable execution on hardware while remaining sufficient to observe learning trends. The quantum component was implemented using Qiskit’s SamplerQNN with 128 measurement shots to balance statistical stability and hardware cost. Due to execution-time constraints, training was performed for 30 epochs using the Adam optimizer with a learning rate of 0.001. These settings were selected to verify that the end-to-end Q-AGNN training process can be executed on real hardware and exhibit stable convergence behavior. Experiments were run on the least busy available IBM backend (ibm_fez) to reduce queue delays and temporal variations in noise characteristics. During execution, both training and testing loss values were recorded at each epoch to analyze convergence under realistic device noise. For reference, the hardware-calibrated noise model of the same backend was imported into a noisy simulator, and the model was trained with identical data partitions and hyperparameters. Fig. 5 shows that the loss trajectories obtained from real hardware closely follow those observed in the noisy simulator. The similarity in convergence behavior indicates that the simulator accurately reflects the device characteristics for this workload and, importantly, that the proposed Q-AGNN pipeline functions as intended on actual quantum hardware. These results provide empirical evidence that Q-AGNN can be executed on current NISQ devices and that its learning dynamics remain consistent outside simulation, thereby validating the practical deployability of the framework under real hardware constraints. 7 Ablation Study To assess the contribution of each component in Q-AGNN, we conduct an ablation study by selectively disabling attention, neighborhood depth, graph structure, or quantum encoding while keeping other components unchanged. • PQC without Attention. Retains quantum encoding and two-hop aggregation but removes attention, aggregating neighbors uniformly. This isolates the effect of attention. • One-Hop Neighborhood with Attention. Considers only immediate neighbors with attention, disabling two-hop aggregation. This evaluates the benefit of higher-order neighborhoods. • One-Hop Neighborhood without Attention. Restricts aggregation to one-hop neighbors and removes attention, assessing the combined impact of omitting higher-order dependencies and attention. • Q-AGNN without Graph Structure (Node-wise QNN). Disables all message passing; each node is independently processed via the PQC. This isolates the contribution of quantum feature encoding without relational inductive bias. • MLP with Attention. Replaces the PQC with a capacity-matched classical MLP, keeping attention and aggregation intact. This evaluates the gain from quantum encoding versus classical encoding. • MLP without Attention. Combines a classical MLP with uniform aggregation (no attention), assessing whether performance gains stem from quantum encoding or classical graph aggregation alone. Table 6: Comparison of Q-AGNN and ablation variants on BoT-IoT, NF-BoT-IoT, UNSW-NB15 and NF-UNSW-NB15 datasets. Precision, recall, and F1-score are reported as macro-averaged to equally account for both normal and attack classes. Accuracy, FPR, FNR, and specificity are reported as global or per-class metrics. Best results are highlighted in bold. Dataset Model Acc Prec Rec F1 FPR FNR Spec BoT-IoT Q-AGNN 1.00 1.00 1.00 1.00 0.00 0.00 1.00 PQC w/o Att 0.7917 0.3958 0.5000 0.4419 0.00 1.0000 1.00 1-Hop + Att 0.9167 0.8737 0.8737 0.8737 0.0526 0.2000 0.9474 1-Hop w/o Att 0.7917 0.3958 0.5000 0.4419 0.00 1.0000 1.00 Node-wise QNN 0.7917 0.3958 0.5000 0.4419 0.00 1.0000 1.00 MLP + Att 0.8750 0.8250 0.7737 0.7949 0.0526 0.4000 0.9474 MLP w/o Att 0.8750 0.8250 0.7737 0.7949 0.0526 0.4000 0.9474 NF-BoT-IoT Q-AGNN 0.8929 0.8106 0.8565 0.8303 0.0870 0.2000 0.9130 PQC w/o Att 0.8214 0.4107 0.5000 0.4510 0.00 1.0000 1.00 1-Hop + Att 0.8929 0.8106 0.8565 0.8303 0.0870 0.2000 0.9130 1-Hop w/o Att 0.8571 0.9259 0.6000 0.6267 0.00 0.8000 1.00 Node-wise QNN 0.9286 0.8571 0.9565 0.8939 0.0870 0.00 0.9130 MLP + Att 0.8929 0.8106 0.8565 0.8303 0.0870 0.2000 0.9130 MLP w/o Att 0.8214 0.4107 0.5000 0.4510 0.00 1.0000 1.00 UNSW-NB15 Q-AGNN 0.9787 0.9881 0.9167 0.9485 0.00 0.1667 1.00 PQC w/o Att 0.8723 0.4362 0.5000 0.4659 0.00 1.0000 1.00 1-Hop + Att 1.00 1.00 1.00 1.00 0.00 0.00 1.00 1-Hop w/o Att 0.8723 0.4362 0.5000 0.4659 0.00 1.0000 1.00 Node-wise QNN 1.00 1.00 1.00 1.00 0.00 0.00 1.00 MLP + Att 0.9574 0.9045 0.9045 0.9045 0.0244 0.1667 0.9756 MLP w/o Att 0.8511 0.4348 0.4878 0.4598 0.0244 1.0000 0.9756 NF-UNSW-NB15 Q-AGNN 0.91111 0.9535 0.6667 0.7256 0.00 0.6667 1.00 PQC w/o Att 0.8667 0.4333 0.5000 0.4643 0.00 1.0000 1.00 1-Hop + Att 0.8667 0.4333 0.5000 0.4643 0.00 1.0000 1.00 1-Hop w/o Att 0.8667 0.4333 0.5000 0.4643 0.00 1.0000 1.00 Node-wise QNN 0.8667 0.4333 0.5000 0.4643 0.00 1.0000 1.00 MLP + Att 0.8667 0.4333 0.5000 0.4643 0.00 1.0000 1.00 MLP w/o Att 0.8667 0.4333 0.5000 0.4643 0.00 1.0000 1.00 Table 6 presents the effect of the individual components in Q-AGNN. Attention emerges as the most consistently important factor. Removing attention often causes a sharp drop in macro-F1, and in several cases the resulting models exhibit near-degenerate behavior with very high specificity but very poor recall for the attack class. This suggests that uniform aggregation is often insufficient to highlight informative neighbors in these intrusion graphs. Second, the effect of graph propagation is dataset-dependent. On some datasets, restricting the model to one-hop aggregation or even removing message passing entirely leads to only a limited degradation, and in a few cases node-wise or one-hop variants remain highly competitive. This suggests that for those datasets, node attributes already carry strong discriminative signals and higher-order relational information offers only marginal additional benefit. In contrast, on more challenging settings such as NF-UNSW-NB15, the full graph-based Q-AGNN substantially outperforms all reduced variants, showing that relational context becomes crucial when the detection task is harder. Third, the role of the PQC should be interpreted jointly with the graph architecture rather than in isolation. While the PQC-based variants without attention are not consistently stronger than their classical MLP counterparts, the fairest comparison is between Q-AGNN and MLP with attention, where the aggregation mechanism is held fixed and only the encoder changes. Under this matched comparison, Q-AGNN achieves equal or better macro-F1 on all datasets, with especially clear gains on BoT-IoT, UNSW-NB15, and NF-UNSW-NB15. This suggests that the quantum encoder is most effective when coupled with attention-guided neighborhood aggregation, rather than as a standalone replacement for a classical feature encoder. Overall, the ablation indicates that Q-AGNN benefits from the interaction of three elements, namely attention-based neighbor selection, graph-aware aggregation, and quantum feature encoding. Their contributions are not uniform across datasets, but the full combination yields the most robust overall behavior. 8 Conclusion In this work, we propose Q-AGNN, a hybrid quantum-classical graph neural network for intrusion detection that combines classical message passing with PQC-based node embeddings to capture higher-order correlations in network graphs. Across multiple benchmark datasets, Q-AGNN achieves competitive or superior performance to classical GNN baselines while maintaining very low false positive rates, a critical requirement for practical IDS. Experiments with hardware-calibrated noisy simulations demonstrate robustness under realistic NISQ conditions, and execution on IBM quantum hardware confirms practical feasibility despite current device limitations. These results highlight the promise of hybrid quantum-classical graph learning for cybersecurity, with future work aimed at scaling PQC embeddings, enhancing noise mitigation, and exploring federated quantum learning for privacy-preserving intrusion detection. Declarations Funding The authors wish to acknowledge that this research received no external funding or financial support. References Ahmed et al. [2025] Ahmed, T., Kashif, M., Marchisio, A., Shafique, M.: A comparative analysis and noise robustness evaluation in quantum neural networks. Scientific Reports 15(1), 33654 (2025) Abreu et al. [2025] Abreu, D., Moura, D., Esteve Rothenberg, C., Abelém, A.: Quantumnetsec: Quantum machine learning for network security. International Journal of Network Management 35(4), 70018 (2025) Altaf et al. [2023a] Altaf, T., Wang, X., Ni, W., Liu, R.P., Braun, R.: Ne-gconv: A lightweight node edge graph convolutional network for intrusion detection. Computers & Security 130, 103285 (2023) Altaf et al. [2023b] Altaf, T., Wang, X., Ni, W., Yu, G., Liu, R.P., Braun, R.: A new concatenated multigraph neural network for iot intrusion detection. Internet of Things 22, 100818 (2023) Bilot et al. [2025] Bilot, T., Jiang, B., Li, Z., El Madhoun, N., Al Agha, K., Zouaoui, A., Pasquier, T.: Sometimes simpler is better: A comprehensive analysis of \State-of-the-Art\\Provenance-Based\ intrusion detection systems. In: 34th USENIX Security Symposium (USENIX Security 25), p. 7193–7212 (2025) Cao et al. [2021] Cao, Y., Jiang, H., Deng, Y., Wu, J., Zhou, P., Luo, W.: Detecting and mitigating ddos attacks in sdn using spatial-temporal graph convolutional network. IEEE Transactions on Dependable and Secure Computing 19(6), 3855–3872 (2021) Caville et al. [2022] Caville, E., Lo, W.W., Layeghy, S., Portmann, M.: Anomal-e: A self-supervised network intrusion detection system based on graph neural networks. Knowledge-based systems 258, 110030 (2022) Defferrard et al. [2016] Defferrard, M., Bresson, X., Vandergheynst, P.: Convolutional neural networks on graphs with fast localized spectral filtering. Advances in neural information processing systems 29 (2016) Devale et al. [2025] Devale, S., Seshadri, K., et al.: Design and evaluation of a quantum variational graph auto-encoder using quantum graph neural networks. Quantum Machine Intelligence 7(2), 1–10 (2025) Hdaib et al. [2024] Hdaib, M., Rajasegarar, S., Pan, L.: Quantum deep learning-based anomaly detection for enhanced network security. Quantum Machine Intelligence 6(1), 26 (2024) Hosler et al. [2024] Hosler, R., Sundar, A., Zou, X., Li, F., Gao, T.: Graph representation learning on novel feature based graphs for network intrusion detection. In: GLOBECOM 2024-2024 IEEE Global Communications Conference, p. 2425–2430 (2024). IEEE Hoang and Van Linh [2025] Hoang, T.B., Van Linh, D.: Intrusion detection in iot networks using a hybrid quantum-classical graph neural network. In: 2025 2nd International Conference On Cryptography And Information Security (VCRIS), p. 1–6 (2025). IEEE Huang et al. [2024] Huang, K., Wang, Y.G., Li, M., et al.: How universal polynomial bases enhance spectral graph neural networks: Heterophily, over-smoothing, and over-squashing. arXiv preprint arXiv:2405.12474 (2024) Hamilton et al. [2017] Hamilton, W., Ying, Z., Leskovec, J.: Inductive representation learning on large graphs. Advances in neural information processing systems 30 (2017) Innan et al. [2024] Innan, N., Sawaika, A., Dhor, A., Dutta, S., Thota, S., Gokal, H., Patel, N., Khan, M.A.-Z., Theodonis, I., Bennai, M.: Financial fraud detection using quantum graph neural networks. Quantum Machine Intelligence 6(1), 7 (2024) Jiang et al. [2025] Jiang, B., Bilot, T., El Madhoun, N., Al Agha, K., Zouaoui, A., Iqbal, S., Han, X., Pasquier, T.: Orthrus: Achieving high quality of attribution in provenance-based intrusion detection systems. In: Security Symposium (USENIX Sec’25). USENIX (2025) Kipf [2016] Kipf, T.: Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907 (2016) Koroniotis et al. [2019] Koroniotis, N., Moustafa, N., Sitnikova, E., Turnbull, B.: Towards the development of realistic botnet dataset in the internet of things for network forensic analytics: Bot-iot dataset. Future Generation Computer Systems 100, 779–796 (2019) Kumar and Swarnkar [2025] Kumar, R., Swarnkar, M.: Quids: A quantum support vector machine-based intrusion detection system for iot networks. Journal of Network and Computer Applications 234, 104072 (2025) Lo et al. [2021] Lo, W.W., Layeghy, S., Sarhan, M., Gallagher, M., Portmann, M.: E-graphsage: A graph neural network based intrusion detection system for iot. arXiv preprint arXiv:2103.16329 (2021) Lingam et al. [2022] Lingam, V., Sharma, M., Ekbote, C., Ragesh, R., Iyer, A., Sellamanickam, S.: A piece-wise polynomial filtering approach for graph neural networks. In: Joint European Conference on Machine Learning and Knowledge Discovery in Databases, p. 412–452 (2022). Springer Li et al. [2024] Li, J., Zhang, Q., Liu, W., Chan, A.B., Fu, Y.-G.: Another perspective of over-smoothing: Alleviating semantic over-smoothing in deep gnns. IEEE Transactions on Neural Networks and Learning Systems 36(4), 6897–6910 (2024) Moustafa and Slay [2015] Moustafa, N., Slay, J.: Unsw-nb15: a comprehensive data set for network intrusion detection systems (unsw-nb15 network data set). In: 2015 Military Communications and Information Systems Conference (MilCIS), p. 1–6 (2015). IEEE Neto et al. [2025] Neto, E.C.P., Iqbal, S., Buffett, S., Sultana, M., Taylor, A.: Deep learning for intrusion detection in emerging technologies: a comprehensive survey and new perspectives. Artificial Intelligence Review 58(11), 340 (2025) Ning et al. [2025] Ning, A., Li, T.Y., Chen, N.Y.: Quantum graph attention network: A novel quantum multi-head attention mechanism for graph learning. arXiv preprint arXiv:2508.17630 (2025) Oono and Suzuki [2019] Oono, K., Suzuki, T.: Graph neural networks exponentially lose expressive power for node classification. arXiv preprint arXiv:1905.10947 (2019) Preskill [2018] Preskill, J.: Quantum computing in the nisq era and beyond. Quantum 2, 79 (2018) Rahman et al. [2023] Rahman, M.A., Shahriar, H., Clincy, V., Hossain, M.F., Rahman, M.: A quantum generative adversarial network-based intrusion detection system. In: 2023 IEEE 47th Annual Computers, Software, and Applications Conference (COMPSAC), p. 1810–1815 (2023). IEEE Rajendran et al. [2019] Rajendran, R., Santhosh Kumar, S., Palanichamy, Y., Arputharaj, K.: Detection of dos attacks in cloud networks using intelligent rule based classification system. Cluster Computing 22(Suppl 1), 423–434 (2019) Sarhan et al. [2020] Sarhan, M., Layeghy, S., Moustafa, N., Portmann, M.: Netflow datasets for machine learning-based network intrusion detection systems. In: International Conference on Big Data Technologies and Applications, p. 117–135 (2020). Springer Sakhnenko et al. [2022] Sakhnenko, A., O’Meara, C., Ghosh, K.J., Mendl, C.B., Cortiana, G., Bernabé-Moreno, J.: Hybrid classical-quantum autoencoder for anomaly detection. Quantum Machine Intelligence 4(2), 27 (2022) Schuld et al. [2021] Schuld, M., Sweke, R., Meyer, J.J.: Effect of data encoding on the expressive power of variational quantum-machine-learning models. Physical Review A 103(3), 032430 (2021) Shahraki et al. [2020] Shahraki, A., Taherkordi, A., Haugen, Ø., Eliassen, F.: A survey and future directions on clustering: From wsns to iot and modern networking paradigms. IEEE Transactions on Network and Service Management 18(2), 2242–2274 (2020) Veličković et al. [2017] Veličković, P., Cucurull, G., Casanova, A., Romero, A., Lio, P., Bengio, Y.: Graph attention networks. arXiv preprint arXiv:1710.10903 (2017) Verdon et al. [2019] Verdon, G., McCourt, T., Luzhnica, E., Singh, V., Leichenauer, S., Hidary, J.: Quantum graph neural networks. arXiv preprint arXiv:1909.12264 (2019) Zheng et al. [2021] Zheng, J., Gao, Q., Lü, Y.: Quantum graph convolutional neural networks. In: 2021 40th Chinese Control Conference (C), p. 6335–6340 (2021). IEEE Zhong et al. [2024] Zhong, M., Lin, M., Zhang, C., Xu, Z.: A survey on graph neural networks for intrusion detection systems: Methods, trends and challenges. Computers & Security 141, 103821 (2024)