A Secure Accountability Framework for Multi-Modal Agent Systems: Detecting, Mitigating, and Auditing Data-Poisoning Attacks via Model Context Protocol (MCP) Servers

Sanjay Nakharu Prasad Kumar

doi:10.32996/jcsts.2025.7.12.1

Research Article

A Secure Accountability Framework for Multi-Modal Agent Systems: Detecting, Mitigating, and Auditing Data-Poisoning Attacks via Model Context Protocol (MCP) Servers

Authors

Sanjay Nakharu Prasad Kumar IEEE Senior Member USA

Abstract

Multi-modal agent systems (MMAS) integrate vision, language, sensor, and symbolic reasoning modules to autonomously collaborate across various domains, including healthcare, finance, and transportation. As these systems interlink, their attack surface broadens—especially to data-poisoning threats that introduce corrupted inputs to distort model learning or inference. Current defenses are localized, reactive, and opaque, frequently devoid of tamper-proof provenance tracking or verifiable accountability. This paper presents a Secure Accountability Framework based on Model Context Protocol (MCP) servers, aimed at detecting, mitigating, and auditing poisoning attacks in distributed MMAS environments. The MCP server functions as a reliable intermediary, offering cryptographic logging, proof-of-state validation, and real-time behavioral deviation analysis. Experimental simulations reveal a 98.5% detection rate, 3.8% latency overhead, and 97.4% attribution accuracy, substantiating MCP servers as a scalable solution for reliable multi-agent AI. The framework aligns with emerging AI governance standards (ISO/IEC 42001, NIST AI RMF, and the EU AI Act), establishing a foundation for transparent, auditable, and compliant AI ecosystems.

Article information

Journal

Journal of Computer Science and Technology Studies

Volume (Issue)

7 (12)

DOI

https://doi.org/10.32996/jcsts.2025.7.12.1

Pages

01-05

Published

2025-11-20

Copyright

Open access

This work is licensed under a Creative Commons Attribution 4.0 International License.

How to Cite

Sanjay Nakharu Prasad Kumar. (2025). A Secure Accountability Framework for Multi-Modal Agent Systems: Detecting, Mitigating, and Auditing Data-Poisoning Attacks via Model Context Protocol (MCP) Servers. Journal of Computer Science and Technology Studies, 7(12), 01-05. https://doi.org/10.32996/jcsts.2025.7.12.1

Journal of Computer Science and Technology Studies

A Secure Accountability Framework for Multi-Modal Agent Systems: Detecting, Mitigating, and Auditing Data-Poisoning Attacks via Model Context Protocol (MCP) Servers

Authors

Abstract

Article information

Journal

Journal of Computer Science and Technology Studies

Volume (Issue)

7 (12)

DOI

https://doi.org/10.32996/jcsts.2025.7.12.1

Pages

01-05

Published

Copyright

Open access

How to Cite

Downloads

3

0

Keywords:

rightbar

submission

menus