AI SECURITY · NETWORK SECURITY ENGINEER · APPLIED AI

Pawan Bishwokarma

|

I write about AI concepts, cybersecurity, network defense, and the projects I build while learning how intelligent systems can be used, secured, and understood.

Recent posts

All posts →

DeepSeek-R1 Explained: Reinforcement Learning, GRPO, and Emergent Reasoning

A college-level breakdown of how DeepSeek-R1 used reinforcement learning and GRPO to incentivize reasoning, why the approach mattered, and how later work on hierarchical reasoning helps explain what may be happening inside RL-trained reasoning models.

deepseek-r1reinforcement-learninggrporeasoning-modelsllms

Introducing SentinelMesh

Why I'm building a security monitoring mesh for AI agents — and what problem it solves.

sentinelmeshagentsprojects

AI Security in 5 Concepts

Five concepts from five years of enterprise security that show up in every serious AI incident.

ai-securityfundamentals