Blog — Pawan Bishwokarma

May 23, 2026

DeepSeek-R1 Explained: Reinforcement Learning, GRPO, and Emergent Reasoning

A college-level breakdown of how DeepSeek-R1 used reinforcement learning and GRPO to incentivize reasoning, why the approach mattered, and how later work on hierarchical reasoning helps explain what may be happening inside RL-trained reasoning models.

deepseek-r1reinforcement-learninggrporeasoning-modelsllms

May 1, 2026

Completing SentinelMesh: LSTM Anomaly Detection and Security Dashboarding

How I added LSTM-based traffic-rate anomaly detection, decision fusion, and a dashboard to SentinelMesh.

sentinelmeshlstmanomaly-detectiondashboardiot-security

Apr 25, 2026

Training SentinelMesh: Building a 1D-CNN IoT Attack Classifier

How I trained a 1D convolutional neural network to classify IoT network traffic as benign or malicious.

sentinelmeshcnniot-securitytensorflowdeep-learning

Apr 12, 2026

Designing SentinelMesh: Architecture for an AI-Powered IoT IDS

How I designed a three-layer IoT intrusion detection system combining CNN classification and LSTM anomaly detection.

sentinelmesharchitectureiot-securitydeep-learning

Apr 2, 2026

Introducing SentinelMesh

Why I'm building a security monitoring mesh for AI agents — and what problem it solves.

sentinelmeshagentsprojects

Mar 16, 2026

AI Security in 5 Concepts

Five concepts from five years of enterprise security that show up in every serious AI incident.

ai-securityfundamentals

Mar 13, 2026

Welcome to pawanbk.io

My space for AI concepts, security projects, and personal perspectives on new developments in AI.

about meai-security