Skip to content
Work Experiences:
- [Feb’24 – Present] Senior Software Engineer at Microsoft
- At Microsoft, I focus on strengthening the security and reliability of large-scale secret management systems. I designed and developed the CredSMART managed certificate revocation mechanism for Torus, ensuring robust certificate lifecycle management. I’ve improved observability and quality across the secret management ecosystem through targeted feature development and enhanced system reliability by resolving critical bugs, repair items, and PIRs. Additionally, I modernized deployment processes by automating workflows and migrating away from legacy procedures. My role also includes live-site management, where I actively handle CredSMART incidents and have driven emergency rotations for Microsoft-wide exposed certificates. Beyond technical contributions, I mentor and guide team members, helping them successfully deliver their projects.
- [Jul’20 – Jan’24]: Software Engineer-II at Amazon
- At Amazon, I contributed to multiple high-impact projects across AWS and Alexa. I played a key role in implementing internal and customer-facing features for AWS SageMaker, including EMR Cluster Lifecycle Management and Selective Step Execution, both supported through the Python SDK. Beyond SageMaker, I optimized the Alexa Entity Disambiguation pipeline to improve accuracy and performance, and developed features for Amazon Kids’ backend infrastructure and mobile app, enhancing user experience for millions of families. I also worked on optimizing Amazon’s internal time-series database, a critical component for Fulfillment Center operations. In addition to development, I handled on-call responsibilities, ensuring the reliability and availability of these services in production.
- [ May’17 – Aug’17 ] : Intern – Microsoft Azure.
- Linearizability Checker for the “Reliable Collection”: Designed a failure resilient microservice-based system that deploys and initiates a database, performs operations on it, injects failure and at the same time generates log. The log is later fed to “Jepsen”, a well-known linearizability checker to check for any inconsistency.
- [ May’16 – Aug’16 ] : Intern – Microsoft Azure.
- Benchmarking Tool: Developed a benchmarking tool using the microservice-based architecture, that is also inspired by the well-known YCSB benchmarking tool, to measure the performance of the Reliable Collection compared to MongoDB and Cassandra. Microsoft also uses this tool for Regression testing
- [ May’14 – Aug’14 ] : Intern – Huawei Technologies.
- A Lightweight “emergency message” Broadcasting Technique: Developed this
technique to spread emergency messages as fast as possible while reducing the
Broadcast Storm in the Vehicular Ad-Hoc Network. I used Vehicle-to-Vehicle communication where a received emergency message was further relayed based on the
received SNR level. NS3 was used to measure the effectiveness of the approach.
- [ Mar’11 – Jul’13 ] : FULL TIME Research Eng. – Samsung R&D Institute Bangladesh.
- Worked on Processor Specific algorithm optimization.
- Implemented and optimized Image Effect Filters for low powered and low memory devices.
- Worked on the development of “Samsung iStudio”, Samsung’s in-house image editing software
Research Experiences:
- [ Sep’15 – Apr’20 ] : Research Assistant – Distributed Systems (DPRG)
- Scalable Distributed Strong Failure Detector: Developing a scalable and distributed strong failure detector. Since existing scalable failure detectors (e.g. SWIM) are weak, and on the other hand, strong failure detectors (e.g. Virtual Synchrony) are not scalable, my research aims to develop a distributed failure detector to support the both.
- Microsoft Service Fabric: Analyzed and measured the performance of “Microsoft Service Fabric”- a distributed framework for building microservice-based applications (in collaboration with Microsoft).
- CAT – A New Impossibility Theorem: Proposed a new impossibility theorem called
“CAT” that would suit for transactional distributed database/storage systems
(NewSQL), where NewSQL systems need to consider the impossibility theorem in
terms of Contention, Abort Rate, and Throughput.
- [ Jan’14 – Aug ’15] : Research Assistant – Wireless Networking
- An Opportunistic MAC protocol: Developed an efficient wireless MAC protocol that opportunistically enhances throughput and reduces the backoff time.
Teaching Experiences: