The difference between hive and relational database

Hive versus relational databases

hive is so similar to relational databases that there is always an illusion in hive learning that hive is a database, not a database. Hive is the client side of Hadoop, with HDFS at the bottom, and the execution engine is MapReduce, which is executed on Hadoop and, in other words, a layer of Hadoop’s client package.

1. Data update

  • hive read more write less
  • mysql usually needs to modify

frequently
2. Data delay

  • mysql usually executes in seconds
  • hive for a longer time:
    • hive query, there is no index, need to scan the whole table, so the delay is high
    • mapreduce when the hive is executed, there will be a shuffle, shuffle to drop the disk, the delay is high

3. Data size

  • hive data scale is large
    • hive is stored in HDFS and built on clusters. You can add machine vertical expansion
  • mysql has storage bottlenecks
    Mysql store on disk

Read More: