Publications
-
Jayjeet Chakraborty, Matthieu Dorier, Philip Carns, Robert Ross, Carlos Maltzahn, Heiner Litz. Thallus: An RDMA-based Columnar Data Transport Protocol. HotInfra 2024, Austin, TX, USA. [paper] [arxiv]
-
Andrew Lamb, Yijie Shen, Daniel Heres, Jayjeet Chakraborty, Mehmet Ozan Kabak, Liang-Chi Hsieh, Chao Sun. Apache Arrow DataFusion: A Fast, Embeddable, Modular Analytic Query Engine. SIGMOD 2024, Santiago, Chile. [paper]
-
Jayjeet Chakraborty, Ivo Jimenez, Sebastiaan Alvarez Rodriguez, Alexandru Uta, Jeff LeFevre and Carlos Maltzahn. Skyhook: Towards an Arrow-Native Storage System. 2022 22nd IEEE International Symposium on Cluster, Cloud and Internet Computing (CCGrid), Taormina, Italy, 2022.[paper]
-
Sebastiaan Alvarez Rodriguez, Jayjeet Chakraborty, Aaron Chu, Ivo Jimenez, Jeff LeFevre, Carlos Maltzahn, Alexandru Uta. Zero-Cost, Arrow-Enabled Data Interface for Apache Spark. SCDM, 2021. [paper]
-
Partha Kumbhakar, Abhirup Roy Karmakar, Gour P. Das, Jayjeet Chakraborty, Chandra Sekhar Tiwary and Pathik Kumbhakar. Reversible temperature-dependent photoluminescence in a semiconductor quantum dot for development of smartphone-based optical thermometer. Nanoscale, 2021. [paper]
-
Jayjeet Chakraborty, Carlos Maltzahn, Ivo Jimenez. Enabling Seamless Execution of Computational and Data Science Workflows on HPC and Cloud with the Popper Container-Native Automation Engine. CANOPIE-HPC Workshop 2020, 12 November, 2020. [paper]
-
Jayjeet Chakraborty, Ivo Jimenez, Carlos Maltzahn, Arshul Mansoori, Quincy Wofford. Popper 2.0: A Container-native Workflow Execution Engine For Testing Complex Applications and Validating Scientific Claims. 2020 Exascale Computing Project Annual Meeting, Houston, TX, February 3-7, 2020. [poster]
- Biswas, S., Chakraborty, J., Agarwal, A. and Kumbhakar, P., Gold nanostructures for the sensing of pH using a smartphone. RSC Advances Journal, 2019. [paper]
Talks
-
Jayjeet Chakraborty, Heiner Litz. Accelerating Billion-Scale ANNS on Modern Hardware. CRSS IAB Meeting Fall 2024, UC Santa Cruz. November 2024. [slides]
-
Jayjeet Chakraborty. Thallus: An RDMA-based Columnar Data Transport Protocol. HotInfra 2024, Austin, TX, USA. [slides]
Jayjeet Chakraborty. Analyzing the Performance of Vector Databases. Invited talk at Broadcom, Palo Alto, CA. June, 2024. [slides]
Jayjeet Chakraborty, Heiner Litz. Towards Optimizing Search and Indexing in Vector Databases. CRSS IAB Meeting Spring 2024, UC Santa Cruz. May, 2024. [slides]
Jayjeet Chakraborty. Towards Faster Columnar Data Transport using RDMA. Computational I/O Stack Workshop 2023, UC Santa Cruz. August, 2023. [slides]
Jayjeet Chakraborty. Optimizing Data Access with Compute Offloading, Fast Hardware-Accelerated Data Transport, and Modern Query Languages. PyHEP.dev Workshop 2023, Princeton University, New Jersey. July, 2023. [slides]
Jayjeet Chakraborty. DOMA R/D and Analysis Grand Challenge. IRIS-HEP AGC Workshop 2023, University of Madison, Wisconsin. May, 2023. [slides]
Jayjeet Chakraborty. Open Source Contribution 101. CSE 110, University of California, Santa Cruz, Winter 2023. [slides]
Jayjeet Chakraborty, Carlos Maltzahn, Stephanie Lieggi. Hidden Gems: Enabling Open Source Communities & Building up Talent Pipelines Through Mentorship. FOSSY 2023, Portland, Oregon. July, 2023.
Jayjeet Chakraborty. SkyhookDM: Embedding Apache Arrow Inside Storage Systems. The Data Thread Conference 2022. [slides]
Jayjeet Chakraborty. Skyhook: Managing Columnar Data Within Storage. PyHEP 2022. [slides]
Jayjeet Chakraborty. SkyhookDM: An Arrow-Native Storage System. SNIA Storage Developer Conference (SDC) 2021. [slides]
Project Reports
Jayjeet Chakraborty. Yosemite: Towards Designing a File Format for Long-Term Archival Storage of Structured Datasets. Project Report, CSE290S - Archival Storage Systems, UC Santa Cruz, Spring 2023. [report]
Jayjeet Chakraborty, Nilesh Negi. Storing Streaming Key-Value Pairs in Aspen. Project Report, CSE293 - Stream Processing Systems, UC Santa Cruz, Winter 2023. [report]
Jayjeet Chakraborty. Quantifying CPU and Network savings in Computational Storage Systems. Project Report, CSE232 - Distributed Systems, UC Santa Cruz, Spring 2022. [report]
Jayjeet Chakraborty. Benchmarking DuckDB with Skyhook. Project Report, CSE215 - Data Management Systems, UC Santa Cruz, Winter 2022. [report]
Jayjeet Chakraborty, Nayan Sanjay Bhatia, Yash Rajesh Chabbria. Benchmarking the Flight Transport Protocol in Different Languages. Project Report, CSE210A - Programming Languages, UC Santa Cruz, Winter 2022. [report]