hadoop - HBase Shell将慢速放入几行表(独立模式)

标签 hadoop hbase

我有一个在独立模式下运行的Hbase 1.4.9,在ubuntu 18.04.1 LTS,4GB RAM和几乎空的表“求和”上运行

带有 PUT 的插入操作必须花费10秒或更长时间,并且有几次尝试以毫秒为单位。 扫描在大多数情况下也很慢。

在独立模式下这正常吗?还是有一些我需要调整的设置?

这是捕获并插入的内容:

hbase(main):002:0> put 'summations','row5','reading:kWh_d_0',30
0 row(s) in 20.4030 seconds

这是扫描
hbase(main):003:0> scan 'summations'
ROW                   COLUMN+CELL
 daab-D035016215-2019 column=reading:kWh_d_45900, timestamp=1549635169098, value
 0110                 =7362.332
 daab-D035016215-2019 column=reading:kWh_r_45900, timestamp=1549635169098, value
 0110                 =0
 row1                 column=reading:kWh_d_0, timestamp=1549633002928, value=500
 row2                 column=reading:kWh_d_0, timestamp=1549633101719, value=500
 row3                 column=reading:kWh_d_0, timestamp=1549633184885, value=500
 row5                 column=reading:kWh_d_0, timestamp=1549635435618, value=30
5 row(s) in 0.0490 seconds

这是表格的说明
hbase(main):004:0> describe 'summations'
Table summations is ENABLED
summations
COLUMN FAMILIES DESCRIPTION
{NAME => 'reading', BLOOMFILTER => 'ROW', VERSIONS => '1', IN_MEMORY => 'false',
 KEEP_DELETED_CELLS => 'FALSE', DATA_BLOCK_ENCODING => 'NONE', TTL => 'FOREVER',
 COMPRESSION => 'NONE', MIN_VERSIONS => '0', BLOCKCACHE => 'true', BLOCKSIZE =>
'65536', REPLICATION_SCOPE => '0'}
1 row(s) in 10.1530 seconds

这是我的hbase-site.xml文件
<?xml version="1.0"?>
<?xml-stylesheet type="text/xsl" href="configuration.xsl"?>

<configuration>
<property>
    <name>hbase.rootdir</name>
    <value>file:///home/user/hbase</value>
  </property>
  <property>
    <name>hbase.zookeeper.property.dataDir</name>
    <value>/home/user/zookeeper</value>
  </property>
  <property>
    <name>hbase.unsafe.stream.capability.enforce</name>
    <value>false</value>
    <description>
      Controls whether HBase will check for stream capabilities (hflush/hsync).

      Disable this if you intend to run on LocalFileSystem, denoted by a rootdir
      with the 'file://' scheme, but be mindful of the NOTE below.

      WARNING: Setting this to false blinds you to potential data loss and
      inconsistent system state in the event of process and/or node failures. If
      HBase is complaining of an inability to use hsync or hflush it's most
      likely not a false positive.
    </description>
  </property>
  <property>
          <name>hbase.zookeeper.property.clientPort</name>
          <value>62181</value>
        </property>
        <property>
                <name>hbase.zookeeper.quorum</name>
                <value>192.168.2.223</value>
        </property>

</configuration>

最佳答案

我尝试更改设置无济于事。
全新安装Hbase可解决此问题。

关于hadoop - HBase Shell将慢速放入几行表(独立模式),我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/54594743/

相关文章:

java - 为文件输入定义手动拆分算法

HBase Zookeeper 关闭连接

java - 基于HBase和Hadoop MapReduce的分布式multimap

hadoop - HBase中筛选查询的性能?

hadoop - 在hbase中自动生成row_key

hadoop - Hive 外部表与内部表命令

datetime - 将 Unix 纪元时间转换为扩展的 ISO8601

java - 为什么HBase RowKey、ColumnKey和value是二进制值(字节),而不是String?

scala - 星火笔记本 : How can I filter rows based on a column value where each column cell is an array of strings?

java - 没有集群的 Hadoop MapReduce - 这可能吗?