一、描述

          现有hbase的查询工具有很多如:Hive,Tez,Impala,Shark/Spark,Phoenix等。今天主要记录Phoenix。

          phoenix,中文译为“凤凰”,很美的名字。Phoenix是由saleforce.com开源的一个项目,后又捐给了Apache基金会。它相当于一个Java中间件,提供jdbc连接,操作hbase数据表。

          Phoenix官网上,对Phoenix讲解已经很详细了。如果英语好,可以看官网,更正式一些。

二、Phoenix安装

1.下载Phoenix

下载地址:http://mirror.bit.edu.cn/apache/phoenix/

phoenix与HBase版本对应关系

Phoenix 2.x - HBase 0.94.x

Phoenix 3.x - HBase 0.94.x

Phoenix 4.x - HBase 0.98.1+

我目前测试使用版本概况:

Hadoop2.7.5--HBase1.4.8

所以我可以用phoenix4.x。下载的压缩包为apache-phoenix-4.14.0-HBase-1.4-bin.tar.gz

wget http://mirror.bit.edu.cn/apache/phoenix/apache-phoenix-4.14.0-HBase-1.4/

 

2.移动压缩包

apache-phoenix-4.14.0-HBase-1.4-bin.tar.gz移动到hbase集群的其中一个服务器的一个目录下

我移动的目录为/usr/local/phoenix/

mv http://mirror.bit.edu.cn/apache/phoenix/apache-phoenix-4.14.0-HBase-1.4/ /usr/local/phoenix/

3.解压缩文件

tar –zxvf apache-phoenix-4.14.0-HBase-1.4-bin.tar.gz

重命名

mv apache-phoenix-4.14.0-HBase-1.4-bin phoenix-4.14.0

可看到有个phoenix-4.14.0/目录,里面包含了Phoenix的所有文件。

4.配置Phoenix

4.1、将phoenix-4.14.0/目录下 phoenix-4.14.0-HBase-1.4-server.jar、phoenix-core-4.14.0-HBase-1.4.jar、phoenix-4.14.0-HBase-1.4-client.jar拷贝到各个 HBase 的 lib目录中(每个节点都要放) 。

4.2、重启hbase集群,使Phoenix的jar包生效。

4.3、将hbase的配置文件hbase-site.xml 放到phoenix-4.14.0/bin/下,替换Phoenix原来的  配置文件。

5.修改权限

切换到phoenix-4.14.0/bin下,修改psql.pysqlline.py的权限为777

命令:chmod 777 文件名

6.验证是否成功

6.1、在phoenix-4.14.0/bin/下输入命令: ./sqlline.py localhost

如果看到如下界面表示启动成功。

6.2、输入!tables,查看都有哪些表。红框部分是我建的表,其他为Phoenix系统表,系统表中维护了用户表的元数据信息。

6.3、退出Phoenix。输入!exit命令、或者!quit命令

三、Phoenix使用

1、建表(在phoenix-4.14.0/bin/目录下)

./psql.py localhost:2181 ../examples/WEB_STAT.sql       

其中../examples/WEB_STAT.sql是建表的sql语句(phoenix自带的案例)--里面的语句如下:

CREATE TABLE IF NOT EXISTS WEB_STAT (

  HOST CHAR(2) NOT NULL,

  DOMAIN VARCHAR NOT NULL,

  FEATURE VARCHAR NOT NULL,

  DATE DATE NOT NULL,

  USAGE.CORE BIGINT,--usage指定列族名

  USAGE.DB BIGINT,--usage指定列族名

  STATS.ACTIVE_VISITOR INTEGER

  CONSTRAINT PK PRIMARY KEY (HOST, DOMAIN, FEATURE, DATE)--指定主键

);

2、导入数据

命令:./psql.py -t WEB_STAT localhost:2181 ../examples/WEB_STAT.csv

PS:其中 -t 后面是表名, ../examples/web_stat.csv 是csv数据(注意数据的分隔符需要是逗号)。

3、查询数据

首先使用sqlline查看(截图为部分列的数据),查询表名不区分大小写

语句:select * from web_stat;

查询2、查询记录总条数

语句:select count(1) from web_stat;

查询3、查询结果分组排序

语句:select domain,count(1) as num from web_stat group by domain order by num desc;

查询4、求平均值

语句:select avg(core) from web_stat;

查询5、多字段分组,排序,别名。

语句:select domain,count(1) as num,avg(core) as core,avg(db) as db from web_stat group by domain order by num desc;

查询6、查询日期类型字段

语句:select host,domain,date from web_stat where TO_CHAR(date)='2013-01-15 07:09:01.000';

查询7、字符串,日期类型转换

语句:select TO_DATE('20131125','yyyyMMdd') from web_stat;

Ps:输入的日期字符串会被转换为hbase表date的日期类型。

总结:Phoenix还支持了很多函数和sql语法,在这里不再一一列举。


四、Phoenix基本shell命令

PS:以下,可能有部分命令在Phoenix更高版本中已失效,改为其他命令代替,请注意。

0: jdbc:phoenix:localhost> help
!set       Set a sqlline variable
!script  Start saving a script to a file
!scan  Scan for installed JDBC drivers
!saveSave the current variabes and aliases
!run   Run a script from the specified file
!rollback  Roll back the current transaction (if autocommit is off)
!rehash Fetch table and column names for command completion
!record     Record all output to the specified file
!reconnect  Reconnect to the database
!quit  /!exit  Exits the program 
!properties Connect to the database specified in the properties file(s)
!procedures List all the procedures
!primarykeys List all the primary keys for the specified table
!outputformat  Set the output format for displaying results  (table,vertical,csv,tsv,xmlattrs,xmlelements)
!nativesql Show the native SQL for the specified statement
!metadata  Obtain metadata information
!manual   Display the SQLLine manual
!list List the current connections
!isolationSet the transaction isolation for this connection
!indexes List all the indexes for the specified table
!importedkeysList all the imported keys for the specified table
!historyDisplay the command history
!help Print a summary of command usage
!goSelect the current connection
!exportedkeysList all the exported keys for the specified table
!dropall Drop all tables in the current database
!describeDescribe a table
!dbinfo  Give metadata information about the database
!connect   Open a new connection to the database.
!commit  Commit the current transaction (if autocommit is off)
!columnsList all the columns for the specified table
!closeall Close all current open connections
!closeClose the current connection to the database
!callExecute a callable statement
!brief Set verbose mode off
!batchStart or execute a batch of statements
!autocommitSet autocommit mode on or off
!all Execute the specified SQL against all the current connections

五、用Phoenix Java api操作HBase

开发环境准备:idea、jdk1.8、window8、hadoop2.7.5、hbase-1.4.8、phoenix4.14.0

1.从集群拷贝以下文件core-site.xml、hbase-site.xml、hdfs-site.xml文件放到工程resources

2.在pom.xml中添加依赖(根据自己的版本添加相应的依赖)

<dependencies>
    <!--https://mvnrepository.com/artifact/org.apache.phoenix/phoenix-core -->
    <dependency>
        <groupId>org.apache.phoenix</groupId>
        <artifactId>phoenix-core</artifactId>
        <version>4.14.0-HBase-1.4</version>
    </dependency>

</dependencies>

4.在客户端C:\Windows\System32\drivers\etc\hosts文件中加入集群的hostname和IP

5.工程截图

6.工程代码

package com.phoenix.day01;

import java.sql.*;

public class Phoenix_Test {
    private static Connection conn=null;
    private static Statement statement = null;
    static {
        try {
            //phoenix4.14.0用下面的驱动对应hbase1.4.+
            Class.forName("org.apache.phoenix.jdbc.PhoenixDriver");
            //这里配置zookeeper的地址,可单个,也可多个。可以是域名或者ip
            String url = "jdbc:phoenix:master2:2181/hbase";
            //String url = "jdbc:phoenix:41.byzoro.com,42.byzoro.com,43.byzoro.com:2181";
            conn = DriverManager.getConnection(url);
            statement = conn.createStatement();
        } catch (Exception e) {
            e.printStackTrace();
        }


    }

    /**
     * 测试类
     * @param args
     * @throws SQLException
     */
    public static void main(String[] args) throws SQLException {
        testRead();
        testReadAll();
        testupsertAll(1000000);
        testUpsert();
        testCreateTable("student");
        testDelete("student");
        testSelect("student");
    }


    /**
     * 扫描全表
     * @param tableName
     * @throws SQLException
     */
    private static void testSelect(String tableName) throws SQLException {
        String sql="select * from "+tableName+"";
        statement.executeUpdate(sql);
        conn.commit();

        conn.close();
        statement.close();
    }

    /**
     * 单挑添加数据
     * @throws SQLException
     */
    private static void testUpsert() throws SQLException {
        String sql1="upsert into test_phoenix_api values(1,'test1')";
        String sql2="upsert into test_phoenix_api values(2,'test2')";
        String sql3="upsert into test_phoenix_api values(3,'test3')";
        statement.executeUpdate(sql1);
        statement.executeUpdate(sql2);
        statement.executeUpdate(sql3);
        conn.commit();
        System.out.println("数据已插入");

        statement.close();
        conn.close();
    }

    /**
     * 删除数据
     * @param tableName
     * @throws SQLException
     */
    private static void testDelete(String tableName) throws SQLException {
        String sql="delete from "+tableName+" where mykey = 1";
        statement.executeUpdate(sql);
        conn.commit();

        conn.close();
        statement.close();
    }

    /**
     * c创建表
     * @param tableName
     * @throws SQLException
     */
    private static void testCreateTable(String tableName) throws SQLException {
        String sql="create table "+tableName+"(mykey integer not null primary key ,mycolumn varchar )";
        statement.executeUpdate(sql);
        conn.commit();

        conn.close();
        statement.close();
    }

    /**
     * 批量插入
     * @param count
     * @throws SQLException
     */
    private static void testupsertAll(int count) throws SQLException {

        for (int i = 0; i < count; i++) {
            String sql ="upsert into STUDENT_CON values("+i+",'lisi',12)";
            statement.executeUpdate(sql);
            conn.commit();
        }
        System.out.println(count+"条数据已插入完毕");
        conn.close();
        statement.close();

    }

    /**
     * 使用phoenix提供的api操作hbase中读取数据
     */
    private static void testReadAll() throws SQLException {
//String sql = "select count(1) as num from web_stat";
        String sql = "select *  from web_stat where core = 35";
        long time = System.currentTimeMillis();
        ResultSet rs = statement.executeQuery(sql);
        while (rs.next()) {
            //获取core字段值
            int core = rs.getInt("core");
            //获取core字段值
            String host = rs.getString("host");
            //获取domain字段值
            String domain = rs.getString("domain");
            //获取feature字段值
            String feature = rs.getString("feature");
            //获取date字段值,数据库中字段为Date类型,这里代码会自动转化为string类型
            String date = rs.getString("date");
            //获取db字段值
            String db = rs.getString("db");
            System.out.println("host:"+host+"\tdomain:"+domain+"\tfeature:"+feature+"\tdate:"+date+"\tcore:" + core+"\tdb:"+db);
        }
        long timeUsed = System.currentTimeMillis() - time;
        System.out.println("time " + timeUsed + "mm");
        //关闭连接
        rs.close();
        statement.close();
        conn.close();
    }

    /**
     * 使用phoenix提供的api操作hbase读取数据
     */
    private static void testRead() throws SQLException {
        //Statement statement = conn.createStatement();
        String sql = "select count(1) as num from web_stat";
        long time = System.currentTimeMillis();
        ResultSet rs = statement.executeQuery(sql);
        while (rs.next()) {
            int count = rs.getInt("num");
            System.out.println("row count is " + count);
        }
        long timeUsed = System.currentTimeMillis() - time;
        System.out.println("time " + timeUsed + "mm");
        //关闭连接
        rs.close();
        statement.close();
        conn.close();
    }
}

Logo

旨在为数千万中国开发者提供一个无缝且高效的云端环境,以支持学习、使用和贡献开源项目。

更多推荐