GSoC/GCI Archive
Google Summer of Code 2014 Twitter

Use zero-copy read path in new Hadoop APIs

by Sunyu for Twitter

The idea is to use new Hadoop API to avoid unnecessary bytes copy in the reading path of Parquet. The idea will bring a performance gain in the client when scanning the file. To work on older version of hadoop, an abstract layer will implemented.