Ankur Dave
YOU?
Author Swipe
View article: In-Memory Indexed Caching for Distributed Data Processing
In-Memory Indexed Caching for Distributed Data Processing Open
Powerful abstractions such as dataframes are only as efficient as their underlying runtime system. The de-facto distributed data processing framework, Apache Spark, is poorly suited for the modern cloud-based data-science workloads due to …
View article: Oblivious coopetitive analytics using hardware enclaves
Oblivious coopetitive analytics using hardware enclaves Open
Coopetitive analytics refers to cooperation among competing parties to run queries over their joint data. Regulatory, business, and liability concerns prevent these organizations from sharing their sensitive data in plaintext.
View article: [Demo] Low-latency Spark Queries on Updatable Data
[Demo] Low-latency Spark Queries on Updatable Data Open
As data science gets deployed more and more into operational applications, it becomes important for data science frameworks to be able to perform computations in interactive, sub-second time. Indexing and caching are two key techniques tha…