commit | 26c3b5361db6b2a3816ec6818e1be4d0b1ce1b3d | [log] [tgz] |
---|---|---|
author | JavierJia <jianfeng.jia@gmail.com> | Fri Oct 23 13:49:32 2015 -0700 |
committer | Jianfeng Jia <jianfeng.jia@gmail.com> | Wed Oct 28 17:20:56 2015 -0700 |
tree | 2a0f2a4560b8d82d1ec5b3d90c218b030d4406ad | |
parent | 492b6fea99a4f6b90f674f46b1945f609cd1f5f1 [diff] |
ASTERIXDB-1102: VarSize Encoding to store length of String and ByteArray This patch is to change the encoding format that stores the length value of the variable length type (e.g. String, ByteArray) from fix-size encoding (2bytes) to variable-size encoding ( 1 to 5bytes) It will solve the issue 1102 to enable us to store a String that longer than 64K. Also for the common case of storing the short string ( <= 127), it will save one byte per string. Some important changes include: 1. Add one hyracks-util package to consolidate all the hyracks independent utility functions. It will reduce the chances of having duplicate utils in different packages. 2. Move parts of Asterix string functions down to Hyracks UTF8StringPointable object, which will benefit the other dependencies, such as VXQuery. Change-Id: I7e95df0f06984b784ebac2c84b97e56a50207d27 Reviewed-on: https://asterix-gerrit.ics.uci.edu/449 Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu> Reviewed-by: Taewoo Kim <wangsaeu@gmail.com> Reviewed-by: Jianfeng Jia <jianfeng.jia@gmail.com>