Unveiling the Bias in Language Models: A Path to Stability in Security Assessments

4/15 (Tue.) 16:15 - 17:00 7F 703

Unveiling the Bias in Language Models: A Path to Stability in Security Assessments

Large Language Models (LLMs) have shown great potential in cybersecurity applications. However, to fully harness their value, inherent biases and stability issues in LLM-driven security assessments must be effectively addressed. This talk will focus on these challenges and present our latest research on improving evaluation frameworks.

Our study analyzes how LLMs can be influenced by the order of presented options during the assessment process, leading to biases. We propose ranking strategies and probabilistic weighting techniques that significantly improve scoring accuracy and consistency. Key topics covered in this talk include experimental design and observations on LLM biases, probability-based weighting adjustments, and methodologies for integrating results from multiple ranking permutations. Notably, through validation with the G-EVAL dataset, we demonstrate measurable improvements in model evaluation performance.

Whether you are conducting research on language models or working in cybersecurity technology and decision-making, this talk will provide valuable technical insights and practical takeaways.

SPEAKER

Chen, Shu-Yuan

CyCraft Technology

Data Scientist, Data Science

TOPIC / TRACK
AI Security & Safety Forum

LOCATION
Taipei Nangang Exhibition Center, Hall 2
7F 703

LEVEL
General General sessions explore new cybersecurity knowledge and non-technical topics, ideal for those with limited or no prior cybersecurity knowledge.

SESSION TYPE
Breakout Session

LANGUAGE
Chinese

SUBTOPIC
AI
AI Security
LLM

More Session

Experience sharing on introducing NIST CSF to address cybersecurity risks

Lai, ChuCheng (George) / Fubon Life Insurance Information Security Department Head

Red Team Assessment and Cybersecurity Product Collaboration Cycle: Best Practices for Strengthening Organizational Defense

Bowen Hsu / DEVCORE Co., Ltd. Senior Vice President, Management Department

Achieve Zero Trust Security with Falcon Identity Protection

Richard Lee / CrowdStrike Solution Engineering

Hide Your Invocation of PowerShell Execution

Jie / Palo Alto Networks Security Architect, Cortex Japac

The Digital Blockade War Game: What we learned at DefCon and Blackhat

Nina Kollars / United States Naval War College Associate Professor, Strategic and Operational Research Department

Jason Vogt / United States Naval War College Assistant Professor, Strategic and Operational Research Department

從數據到決策與治理 - 簡介 FAIR CRQ 模型

小P / ISACA總會外部專家委員