Citation: | DING Guo-hao, XU Chen, QIAN Wei-ning. Efficient data loading for log-structured data stores[J]. Journal of East China Normal University (Natural Sciences), 2019, (5): 143-158. doi: 10.3969/j.issn.1000-5641.2019.05.012 |
[1] |
O'NEIL P, CHENG E, GAWLICK D, et al. The log-structured merge-tree (LSM-tree)[J]. Acta Informatica, 1996, 33(4):351-385. doi: 10.1007/s002360050048
|
[2] |
CHANG F, DEAN J, GHEMAWAT S, et al. Bigtable: A distributed storage system for structured data[C]//OSDI'06 Proceedings of the 7th USENIX Symposium on Operating Systems Design and Implementation-Volume 7. 2006: 205-218.
|
[3] |
LevelDB[EB/OL].[2019-06-09]. https://github.com/google/leveldb.
|
[4] |
Hbase[EB/OL].[2019-06-09]. http://hbase.apache.org/.
|
[5] |
OceanBase[EB/OL].[2019-06-09]. https://github.com/alibaba/oceanbase/.
|
[6] |
TiDB[EB/OL].[2019-06-09]. https://university.pingcap.com/
|
[7] |
COOPER B F, RAMAKRISHNAN R, SRIVASTAVA U, et al. PNUTS:Yahoo!'s hosted data serving platform[J]. Proceedings of the VLDB Endowment, 2008, 1(2):1277-1288. doi: 10.14778/1454159.1454167
|
[8] |
SILBERSTEIN A, COOPER B F, SRIVASTAVA U, et al. Efficient bulk insertion into a distributed ordered table[C]//Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data. ACM, 2008: 765-778.
|
[9] |
SILBERSTEIN A E, SEARS R, ZHOU W, et al. A batch of PNUTS: Experiences connecting cloud batch and serving systems[C]//Proceedings of the 2011 ACM SIGMOD International Conference on Management of data. ACM, 2011: 1101-1112.
|
[10] |
Hadoop[EB/OL].[2019-06-09]. https://hadoop.apache.org/
|
[11] |
Cassandra[EB/OL].[2019-06-09]. http://cassandra.apache.org/.
|
[12] |
CDEAR[EB/OL].[2019-06-09]. https://github.com/daseECNU/Cedar/.
|
[13] |
AZQUETA-ALZÚAZ A, PATIÑO-MARTINEZ M, BRONDINO I, et al. Massive data load on distributed database systems over HBase[C]//Proceedings of the 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing. IEEE, 2017: 776-779.
|
[14] |
DEAN J, GHEMAWAT S. MapReduce:Simplified data processing on large clusters[J]. Communications of the ACM, 2008, 51(1):107-113. doi: 10.1145/1327452.1327492
|
[15] |
BARCLAY T, BARNES R, GRAY J, et al. Loading databases using dataflow parallelism[J]. ACM Sigmod Record, 1994, 23(4):72-83. doi: 10.1145/190627.190647
|
[16] |
RAMAKRISHNAN S R, SWART G, URMANOV A. Balancing reducer skew in MapReduce workloads using progressive sampling[C]//Proceedings of the 3rd ACM Symposium on Cloud Computing. ACM, 2012: Article No.16.
|