Abstract
Blockchain has been widely deployed in various fields, such as finance, education, and public services. Blockchain has decentralized mechanisms with persistency and auditability and runs as an immutable distributed ledger, where transactions are jointly performed through cryptocurrency-based consensus algorithms by worldwide distributed nodes. There have been many survey papers reviewing the blockchain technologies from different perspectives, e.g., digital currencies, consensus algorithms, and smart contracts. However, none of them have focused on the blockchain data management systems. To fill in this gap, we have conducted a comprehensive survey on the data management systems, based on three typical types of blockchain, i.e., standard blockchain, hybrid blockchain, and DAG (Directed Acyclic Graph)-based blockchain. We categorize their data management mechanisms into three layers: blockchain architecture, blockchain data structure, and blockchain storage engine, where block architecture indicates how to record transactions on a distributed ledger, blockchain data structure refers to the internal structure of each block, and blockchain storage engine specifies the storage form of data on the blockchain system. For each layer, the works advancing the state-of-the-art are discussed together with technical challenges. Furthermore, we lay out several possible future research directions for the blockchain data management systems.
- [1] . 2019. Blockchain in industries: A survey. IEEE Access 7 (2019), 36500–36515.Google Scholar
Cross Ref
- [2] . 2019. A blockchain-based certificateless public key signature scheme for vehicle-to-infrastructure communication in VANETs. J. Syst. Archit. 99 (2019).Google Scholar
Digital Library
- [3] . 2016. Blockstack: A global naming and storage system secured by blockchains. In USENIX Annual Technical Conference (ATC’16).Google Scholar
- [4] . 2018. A blockchain-based decentralized data storage and access framework for PingER. In IEEE International Conference on Trust, Security and Privacy in Computing and Communications/IEEE International Conference on Big Data Science and Engineering (TrustCom/BigDataSE’18).Google Scholar
Cross Ref
- [5] . 2019. Blockchain security attack: A brief survey. In International Conference on Computing, Communication and Networking Technologies (ICCCNT’19).Google Scholar
Cross Ref
- [6] . 2015. Ripple: Overview and outlook. In 8th International Conference on Trust and Trustworthy Computing (TRUST’15).Google Scholar
Cross Ref
- [7] . 2014. Enabling blockchain innovations with pegged sidechains. http://www.opensciencereview.com/papers/123/enablingblockchain-innovations-with-pegged-sidechains.Google Scholar
- [8] . 2021. Rahasak—scalable blockchain architecture for enterprise applications. J. Syst. Archit. 116 (2021), 102061.Google Scholar
Digital Library
- [9] . 2015. SoK: Research perspectives and challenges for bitcoin and cryptocurrencies. In Symposium on Security and Privacy (SP’15).Google Scholar
- [10] . 2014. The mini-blockchain scheme. White Paper.Google Scholar
- [11] . 2018. The latest gossip on BFT consensus. CoRR abs/1807.04938.Google Scholar
- [12] . 2014. A next-generation smart contract and decentralized application platform. White Paper 3, 37 (2014).Google Scholar
- [13] . 2016. Architecture of the hyperledger blockchain fabric. In Workshop on Distributed Cryptocurrencies and Consensus Ledgers.Google Scholar
- [14] . 2017. An improved P2P file system scheme based on IPFS and Blockchain. In IEEE International Conference on Big Data (BigData’17).Google Scholar
Cross Ref
- [15] . 2021. Block-LSM: An ether-aware block-ordered LSM-tree based key-value storage engine. In International Conference on High Performance Computing and Communications (ICCD’21).Google Scholar
Cross Ref
- [16] . 2016. Byteball: A decentralized system for storage and transfer of value. https://byteball.org/Byteball.pdf.Google Scholar
- [17] . 2016. Bip152: Compact block relay. See https://github. com/bitcoin/bips/blob/master/bip-0152.mediawiki.Google Scholar
- [18] . 2019. Blockchain for Internet of Things: A survey. IEEE Internet Things J. 6, 5 (2019), 8076–8094.Google Scholar
Cross Ref
- [19] . 2018. A low storage room requirement framework for distributed ledger in blockchain. IEEE Access 6 (2018), 22970–22975.Google Scholar
Cross Ref
- [20] . 2019. Jidar: A jigsaw-like data reduction approach without trust assumptions for bitcoin system. In IEEE International Conference on Distributed Computing Systems (ICDCS’19).Google Scholar
Cross Ref
- [21] . 2019. Towards scaling blockchain systems via sharding. In International Conference on Management of Data (SIGMOD’19).Google Scholar
Digital Library
- [22] . 2019. Txilm: Lossy block compression with salted short hashing. CoRR abs/1906.06500 (2019).Google Scholar
- [23] . 2018. IOTA-next generation block chain. Int. J. Eng. Comput. Sci. (IJECS) 7, 4 (2018), 23823–23826.Google Scholar
- [24] . 2017. Blockchain-based system for secure data storage with private keyword search. In IEEE World Congress on Services (SERVICES’17).Google Scholar
- [25] . 2021. Towards a blockchain database for massive IoT workloads. In IEEE International Conference on Data Engineering Workshops (ICDE’21).Google Scholar
Cross Ref
- [26] . 2018. A survey on opportunities and challenges of Blockchain technology adoption for revolutionary innovation. In Proceedings of the 9th International Symposium on Information and Communication Technology (SoICT’18).Google Scholar
Digital Library
- [27] . 2019. BlockchainDB—towards a shared database on blockchains. In International Conference on Management of Data (SIGMOD’19).Google Scholar
Digital Library
- [28] . 2020. A blockchain-based attribute-based signcryption scheme to secure data sharing in the cloud. J. Syst. Archit. 102 (2020).Google Scholar
Digital Library
- [29] . 2019. A survey on privacy protection in blockchain system. J. Netw. Comput. Appl. 126 (2019), 45–58.Google Scholar
Cross Ref
- [30] . 2019. IOTA-based directed acyclic graphs without orphans. CoRR abs/1901.07302 (2019).Google Scholar
- [31] . 2018. Blockchain storage analysis and optimization of bitcoin miner node. In Communications, Signal Processing, and Systems (CSPS’18).Google Scholar
- [32] . 2020. A semi-autonomous distributed blockchain-based framework for UAVs system. J. Syst. Archit. 107 (2020), 101728.Google Scholar
Cross Ref
- [33] . 2011. LevelDB.Google Scholar
- [34] . 2020. Blockchain enabled distributed storage and sharing of personal data assets. In IEEE International Conference on Data Engineering Workshops (ICDE’20).Google Scholar
Cross Ref
- [35] . 2019. Design and optimization for storage mechanism of the public blockchain based on redundant residual number system. IEEE Access 7 (2019), 98546–98554.Google Scholar
Cross Ref
- [36] . 2018. Efficiently processing temporal queries on hyperledger fabric. In IEEE International Conference on Data Engineering (ICDE’18).Google Scholar
Cross Ref
- [37] . 2018. On building efficient temporal indexes on hyperledger fabric. In IEEE International Conference on Cloud Computing (CLOUD’18).Google Scholar
Cross Ref
- [38] . 2018. Jupiter: A blockchain platform for Mobile devices. In IEEE International Conference on Data Engineering (ICDE’18).Google Scholar
Cross Ref
- [39] . 2018. Managing lifetime healthcare data on the blockchain. In IEEE SmartWorld, Ubiquitous Intelligence & Computing, Advanced & Trusted Computing, Scalable Computing & Communications, Cloud & Big Data Computing, Internet of People and Smart City Innovation (SmartWorld/SCALCOM/UIC/ ATC/CBDCom/IOP/SCI).Google Scholar
- [40] . 2017. Bitcoin Blockchain Size Over Time. http://blockchain.info/charts/blocks-size.Google Scholar
- [41] . 2020. BZIP: A compact data memory system for UTXO-based blockchains. J. Syst. Archit. 109 (2020), 101809.Google Scholar
Cross Ref
- [42] . 2018. A survey on security and privacy issues of blockchain technology. Math. Found. Comput. 1, 2 (2018), 121.Google Scholar
Cross Ref
- [43] . 2021. An off-the-chain execution environment for scalable testing and profiling of smart contracts. In USENIX Annual Technical Conference (ATC’21).Google Scholar
- [44] . 2018. OmniLedger: A secure, scale-out, decentralized ledger via sharding. In IEEE Symposium on Security and Privacy (SP’18).Google Scholar
- [45] . 2021. Clouseau: Blockchain-based data integrity for HDFS clusters. In IEEE International Conference on Data Engineering (ICDE’21).Google Scholar
Cross Ref
- [46] . 2016. Cosmos: A network of distributed ledgers. https://cosmos.network/whitepaper.Google Scholar
- [47] . 2020. A survey of IoT applications in blockchain systems: Architecture, consensus, and traffic modeling. ACM Computi. Surv. (CSUR) 53, 1 (2020), 1–32.Google Scholar
Digital Library
- [48] . 2015. DagCoin: A cryptocurrency without blocks. White Paper.Google Scholar
- [49] . 2017. Method and system for storage and retrieval of blockchain blocks using galois fields. U.S. Patent No. 9,569,771.Google Scholar
- [50] . 2018. Scaling Nakamoto consensus to thousands of transactions per second. CoRR abs/1805.03870 (2018).Google Scholar
- [51] . 2020. A decentralized blockchain with high throughput and fast confirmation. In USENIX Annual Technical Conference (ATC’20).Google Scholar
- [52] . 2017. A searchable symmetric encryption scheme using BlockChain. CoRR abs/1711.01030 (2017).Google Scholar
- [53] . 2018. Block-secure: Blockchain based scheme for secure P2P cloud storage. Inf. Sci. 465 (2018), 219–231.Google Scholar
Digital Library
- [54] . 2020. A survey on the security of blockchain systems. Future Gener. Comput. Syst. 107 (2020), 841–853.Google Scholar
Digital Library
- [55] . 2017. EtherQL: A query layer for blockchain system. In 1055 International Conference on Database Systems for Advanced Applications (DASFAA’17).Google Scholar
- [56] . 2017. A survey of blockchain security issues and challenges. Int. J. Netw. Secur. 19, 5 (2017), 653–659.Google Scholar
- [57] . 2020. Towards eidetic blockchain systems with enhanced provenance. In IEEE International Conference on Data Engineering Workshops (ICDE’20).Google Scholar
Cross Ref
- [58] . 2019. A survey on security verification of blockchain smart contracts. IEEE Access 7 (2019), 77894–77904.Google Scholar
Cross Ref
- [59] . 2019. LightChain: A lightweight blockchain system for industrial Internet of Things. IEEE Trans. Ind. Inform. 15, 6 (2019), 3571–3581.Google Scholar
Cross Ref
- [60] . 2015. Segregated Witness (Consensus Layer). Bitcoin Core Development Team, Technical Report BIP 141.Google Scholar
- [61] . 2016. A secure sharding protocol for open blockchains. In Computer and Communications Security (SIGSAC’16).Google Scholar
- [62] . 2020. Atomic cross chain swaps via relays and adapters. In [email protected] 2020: Proceedings of the 3rd Workshop on Cryptocurrencies and Blockchains for Distributed Systems.Google Scholar
Digital Library
- [63] . 1975. Hash table methods. ACM Comput. Surv. (CSUR) 7, 1 (1975), 5–19.Google Scholar
Digital Library
- [64] . 2016. Bigchaindb: A scalable blockchain database. White Paper, BigChainDB (2016).Google Scholar
- [65] . 2018. A blockchain database application platform. CoRR abs/1808.05199 (2018).Google Scholar
- [66] . 2019. Bitcoin: A Peer-to-peer Electronic Cash System.
Technical Report . Manubot.Google Scholar - [67] . 2018. What is the Raiden Network. https://raiden.network.Google Scholar
- [68] . 2018. A survey about consensus algorithms used in blockchain. J. Inf. Process. Syst. 14, 1 (2018), 101–128.Google Scholar
- [69] . 2018. Empowering light nodes in blockchains with block summarization. In International Conference on New Technologies, Mobility and Security (NTMS’18).Google Scholar
Cross Ref
- [70] . 2019. An alternative paradigm for developing and pricing storage on smart contract platforms. In IEEE International Conference on Decentralized Applications and Infrastructures (DAPPCON’19).Google Scholar
Cross Ref
- [71] . 2019. VQL: Providing query efficiency and data authenticity in blockchain systems. In IEEE International Conference on Data Engineering Workshops (ICDE’19).Google Scholar
Cross Ref
- [72] . 2018. Erasure code-based low storage blockchain node. In International Conference on Internet of Things (iThings) and Green Computing and Communications (GreenCom) and Cyber, Physical and Social Computing (CPSCom) and Smart Data (SmartData).Google Scholar
Cross Ref
- [73] . 2021. RainBlock: Faster transaction processing in public blockchains. In USENIX Annual Technical Conference (ATC’21).Google Scholar
- [74] . 2019. Scalable and efficient data authentication for decentralized systems. CoRR abs/1909.11590 (2019).Google Scholar
- [75] . 2017. Plasma: Scalable autonomous smart contracts. White Paper (2017), 1–47.Google Scholar
- [76] . 2016. The Bitcoin Lightning Network: Scalable Off-chain Instant Payments.Google Scholar
- [77] . 2018. The tangle. White paper. http://www.descryptions.com/Iota.pdf.Google Scholar
- [78] . 2019. Equilibria in the tangle. Comput. Indust. Eng. 136 (2019), 160–172.Google Scholar
Digital Library
- [79] . 2020. BFT-store: Storage partition for permissioned blockchain via erasure coding. In IEEE International Conference on Data Engineering (ICDE’20).Google Scholar
Cross Ref
- [80] . 2021. A blockchain-based access control scheme with multiple attribute authorities for secure cloud data sharing. J. Syst. Archit. 112 (2021), 101854.Google Scholar
Cross Ref
- [81] . 2018. LeapChain: Efficient blockchain verification for embedded IoT. In International Conference on Computer-Aided Design (ICCAD’18).Google Scholar
Digital Library
- [82] . 2020. Curbing address reuse in the IOTA distributed ledger: A cuckoo-filter-based approach. Trans. Eng. Manage. 67, 4 (2020), 1244–1255.Google Scholar
Cross Ref
- [83] . 2005. Index of learning styles questionnaire. http://www.engr.ncsu.edu/learningstyles/ilsweb.html.Google Scholar
- [84] . 2020. On sharding open blockchains with smart contracts. In IEEE International Conference on Data Engineering (ICDE’20).Google Scholar
Cross Ref
- [85] . 2018. Blockchain: Challenges and applications. In International Conference on Information Networking (ICOIN’18).Google Scholar
Cross Ref
- [86] . 2019. Storage structure of student record based on hyperledger fabric blockchain. In International Conference on Advanced Information Technologies (ICAIT’19).Google Scholar
Cross Ref
- [87] . 2019. Monoxide: Scale out blockchains with asynchronous consensus zones. In USENIX Symposium on Networked Systems Design and Implementation (NSDI’19).Google Scholar
- [88] . 2019. Re-tangle: A ReRAM-based processing-in-memory architecture for transaction-based blockchain. In Proceedings of the International Conference on Computer-Aided Design (ICCAD’19).Google Scholar
Cross Ref
- [89] . 2018. A blockchain-based framework for data sharing with fine-grained access control in decentralized storage systems. IEEE Access 6 (2018), 38437–38450.Google Scholar
Cross Ref
- [90] . 2020. ABACUS: Address-partitioned bloom filter on address checking for uniqueness in IoT blockchain. In International Conference on Computer Aided Design (ICCAD’20).Google Scholar
Digital Library
- [91] . 2019. A survey on consensus mechanisms and mining strategy management in blockchain networks. IEEE Access 7 (2019), 22328–22370.Google Scholar
Cross Ref
- [92] . 2015. Scalability. Bitcoin Wiki.Google Scholar
- [93] . 2014. Metadisk a blockchain-based decentralized file storage application. Technical Report. 1–11.Google Scholar
- [94] 2017. Cryptokitties is causing ethereum network congestion. https://qz.com/1145833/cryptokitties-is-causing-ethereum-network-congestion.Google Scholar
- [95] . 2016. Polkadot: Vision for a heterogeneous multi-chain framework. White Paper.Google Scholar
- [96] . 2019. vChain: Enabling verifiable boolean range queries over blockchain databases. In International Conference on Management of Data (SIGMOD’19).Google Scholar
Digital Library
- [97] . 2018. Section-blockchain: A storage reduced blockchain protocol, the foundation of an autotrophic decentralized storage architecture. In International Conference on Engineering of Complex Computer Systems (ICECCS’18).Google Scholar
Cross Ref
- [98] . 2018. CUB, a consensus unit-based storage scheme for blockchain system. In IEEE International Conference on Data Engineering (ICDE’18).Google Scholar
Cross Ref
- [99] . 2018. Blockchain-based publicly verifiable data deletion scheme for cloud storage. J. Netw. Comput. Appl. 103 (2018), 185–193.Google Scholar
Digital Library
- [100] . 2018. Smart-toy-edge-computing-oriented data exchange based on blockchain. J. Syst. Archit. 87 (2018), 36–48.Google Scholar
Cross Ref
- [101] . 2018. RapidChain: Scaling blockchain via full sharding. In ACM SIGSAC Conference on Computer and Communications Security (CCS’18).Google Scholar
Digital Library
- [102] . 2021. Authenticated keyword search in scalable hybrid-storage blockchains. In IEEE International Conference on Data Engineering (ICDE’21).Google Scholar
Cross Ref
- [103] . 2019. GEM^2-Tree: A gas-efficient structure for authenticated range queries in blockchain. In IEEE International Conference on Data Engineering (ICDE’19).Google Scholar
Cross Ref
- [104] . 2020. Blockchain-based systems and applications: A survey. J. Internet Technol. 21, 1 (2020), 1–14.Google Scholar
- [105] . 2021. Meepo: Sharded consortium blockchain. In IEEE International Conference on Data Engineering (ICDE’21).Google Scholar
Cross Ref
- [106] . 2018. An innovative IPFS-based storage model for blockchain. In International Conference on Web Intelligence (WI’18).Google Scholar
Cross Ref
- [107] . 2017. An overview of blockchain technology: Architecture, consensus, and future trends. In International Congress on Big Data (BigData Congress’17).Google Scholar
- [108] . 2020. HF-BF: A hotness-aware fine-grained bloom filter for unique address checking in IoT blockchain. In International Conference on High Performance Computing and Communications (HPCC’20).Google Scholar
Cross Ref
Index Terms
A Survey of Blockchain Data Management Systems
Recommendations
A comprehensive blockchain technology survey: architecture, applications and challenges
Blockchain technology has evolved to solve the complexity and privacy issues associated with online digital content distribution in the last decade. Several studies have centred on different implementations of blockchain technology, and there is no ...






Comments