IJCOPE Journal

UGC Logo DOI / ISO Logo

International Journal of Creative and Open Research in Engineering and Management

A Peer-Reviewed, Open-Access International Journal Supporting Multidisciplinary Research, Digital Publishing Standards, DOI Registration, and Academic Indexing.
Journal Information
ISSN: 3108-1754 (Online)
Crossref DOI: Available
ISO Certification: 9001:2015
Publication Fee: 599/- INR
Compliance: UGC Journal Norms
License: CC BY 4.0
Peer Review: Double Blind
Volume 02, Issue 05

Published on: May 2026

BIG DATA ANALYTICS IN INDIAN E-COMMERCE: A COMPREHENSIVE CASE STUDY OF FLIPKART'S DATA-DRIVEN ARCHITECTURE AND STRATEGY

Siddhi Raktate

Prof. Avishek Das

PGDM – Business Analytics [ ISMS, PUNE]

Article Status

Plagiarism Passed Peer Reviewed Open Access

Available Documents

Abstract

This paper presents a comprehensive case analysis of Flipkart's big data analytics strategy, examined through the core theoretical and technical frameworks of the discipline. Flipkart, India's largest e-commerce platform with over 500 million registered users and a peak processing capacity of eight million orders per day, generates three to five petabytes of data daily—a scale that necessitates a sophisticated, multi-layered data architecture. Drawing on engineering disclosures, industry benchmarking data, and a proprietary product catalogue dataset of approximately 20,000 records, this study maps Flipkart's operational practices against the five Vs of big data (volume, velocity, variety, veracity, and value), the Hadoop and Apache Spark ecosystems, SQL and NoSQL paradigms, OLAP and OLTP processing models, data warehousing and data lake architectures, ETL pipelines, and all four types of analytics maturity. Three embedded case studies—covering Big Billion Days sale optimisation, the personalised recommendation engine, and Ekart last-mile logistics analytics—illustrate how abstract technical concepts translate into measurable business outcomes, including a 30% reduction in stockouts, a 91% first-attempt delivery success rate, and recommendation-driven revenue contribution of approximately 15% of total gross merchandise value. A structured peer comparison with Amazon India, Myntra, and Meesho benchmarks Flipkart's capabilities against industry leaders and identifies strategic gaps. The paper concludes with seven prioritised recommendations—spanning data lakehouse migration, federated machine learning, graph-based fraud detection, causal inference pricing, vernacular natural language processing, data governance, and edge analytics—to strengthen Flipkart's competitive data position.

Keywords: Big data, E-commerce analytics, Hadoop, Apache Spark, Flipkart, Recommendation systems, Demand Forecasting, Data lake, ETL pipeline, cluster computing, fault tolerance, OLAP, NoSQL, machine learning, data governance

How to Cite this Paper

Raktate, S. (2026). Big Data Analytics in Indian E-Commerce: A Comprehensive Case Study of Flipkart's Data-Driven Architecture and Strategy. International Journal of Creative and Open Research in Engineering and Management, <i>02</i>(05). https://doi.org/10.55041/ijcope.v2i5.479

Raktate, Siddhi. "Big Data Analytics in Indian E-Commerce: A Comprehensive Case Study of Flipkart's Data-Driven Architecture and Strategy." International Journal of Creative and Open Research in Engineering and Management, vol. 02, no. 05, 2026, pp. . doi:https://doi.org/10.55041/ijcope.v2i5.479.

Raktate, Siddhi. "Big Data Analytics in Indian E-Commerce: A Comprehensive Case Study of Flipkart's Data-Driven Architecture and Strategy." International Journal of Creative and Open Research in Engineering and Management 02, no. 05 (2026). https://doi.org/https://doi.org/10.55041/ijcope.v2i5.479.

Search & Index

References


  1. Amazon Science. (n.d.). DeepAR: Probabilistic forecasting with autoregressive recurrent networks. https://www.amazon.science

  2. Business Standard. (2024, October). Flipkart Internet FY24 revenue and financials. Business Standard. https://www.business-standard.com

  3. Chambers, B., & Zaharia, M. (2018). Spark: The definitive guide. O'Reilly Media.

  4. Codd, E. F. (1970). A relational model of data for large shared data banks. Communications of the ACM, 13(6), 377–387. https://doi.org/10.1145/362384.362685

  5. Davenport, T. H., & Harris, J. G. (2007). Competing on analytics: The new science of winning. Harvard Business Review Press.

  6. Databricks. (n.d.). The lakehouse: A new generation of open platforms that unify data warehousing and advanced analytics. https://www.databricks.com/lakehouse

  7. Deccan Founders. (2025, September). Why Flipkart is still making losses. Deccan Founders. https://www.deccanfounders.com

  8. Flipkart Engineering Blog. (n.d.). Big data infrastructure at Flipkart. https://tech.flipkart.com/blog/big-data

  9. Flipkart Tech@Scale Conference. (2022). Recommendation systems at Flipkart [Conference proceedings]. Flipkart.

  10. Gartner. (2024). Magic quadrant for analytics and business intelligence platforms 2024. Gartner. https://www.gartner.com

  11. GrowthJockey. (2025). Flipkart business model analysis 2025. GrowthJockey. https://www.growthjockey.com

  12. Kimball, R., & Ross, M. (2013). The data warehouse toolkit: The definitive guide to dimensional modelling (3rd ed.). Wiley.

  13. Laney, D. (2001). 3D data management: Controlling data volume, velocity, and variety [Research Note]. META Group.

  14. Marr, B. (2015). Big data: Using SMART big data, analytics and metrics to make better decisions and improve performance. Wiley.

  15. Marz, N., & Warren, J. (2015). Big data: Principles and best practices of scalable real-time data systems. Manning Publications.

  16. McKinsey Global Institute. (2023). Big data: The next frontier for innovation, competition, and productivity (2023 update). McKinsey & Company.

Ethical Compliance & Review Process

  • All submissions are screened under plagiarism detection.
  • Review follows editorial policy.
  • Authors retain copyright.
  • Peer Review Type: Double-Blind Peer Review
  • Published on: May 16 2026
CCBYNC

This article is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License. You are free to share and adapt this work for non-commercial purposes with proper attribution.

View License
Scroll to Top