My academic research focuses on mathematical foundations and algorithms dealing with large amounts of data. Topics include streaming algorithms, numerical linear algebra, machine learning, fast dimensionality reduction, vector search, clustering, and data mining. I taught these topics at Tel Aviv university and at Princeton.
I received my B.Sc in Physics and Computer Science from Tel Aviv University and my Ph.D. in Computer Science from Yale University. After that, I was a Postdoctoral fellow at Yale in the Program in Applied Mathematics. I taught Advanced CS courses at Tel Aviv University and at Princeton.
I served on academic review committees for conferences and journals including NeurIPS, SIGIR, AISTATS, ICML, KDD, WSDM, WWW, COLT, ICML, ESA, SODA, and FOCS.
Edo Liberty, Alexandr Andoni, Eldar Kleiner
tl;dr: This new paper nearly resolves a critical step in KV-caches compression, allowing model serving of larger contexts or more sessions on the same infrastructure. It proves any KV cache (K,V) contains a subset (K',V') of size roughly O(sqrt(d)e^(r)/eps)) such that ||Attn(q,K,V) - Attn(q,K',V')|| < eps for all queries ||q|| < r.
Edo Liberty, Zohar Karnin, Bing Xiang, Laurence Rouesnel, Baris Coskun, Ramesh Nallapati, Julio Delgado Mangas, Amir Sadoughi, Yury Astashonok, Piali Das, Can Balioglu, Saswata Chakravarty, Madhav Jha, Philip Gautier, Tim Januschowski, Valentin Flunkert, David Arpin, and Alex Smola.
SIGMOD 2020
tl;dr: The culmination of more than two years of work, this paper describes the algorithms and distributed architecture behind Amazon SageMaker's elastic ML algorithms.
Zohar Karnin, Kevin Lang, Edo Liberty
FOCS 2016
tl;dr: This paper describes the KLL algorithm. It resolves one of the longest standing and basic problems in the streaming algorithms literature. Namely, optimally approximating ranks and quantiles in streaming data. It is implemented in many database systems including BigQuery and Apache DataSketches. [slides] [Python code]
Graham Cormode, Zohar Karnin, Edo Liberty, Justin Thaler, Pavel Veselý
PODS 2021, Best paper award - 2022 ACM SIGMOD Research Highlight Award
tl;dr: This (finally) solves a problem I wanted to solve for years. Namely, how to efficiently sketch quantiles with relative errors. This is critical for large scale performance monitoring, for example.
Edo Liberty
KDD 2013, Best paper award
tl:dr: This paper introduced frequent-directions (FD), an incredibly simple, practically efficient, and theoretically optimal algorithm for approximating the covariance of vector streams. The followup work Even Simpler Deterministic Matrix Sketching later simplified the proof to a single line. Frequent-directions is implemented in many AI platforms, typically for approximating the Hessian. It is also widely taught in CS classes. [slides], [talk], [bib].
See Frequent-Directions in python and experiments repo that Mina Ghashami and I made public.
Nir Ailon, Zohar Karnin, Edo Liberty, Yoelle Maarek
Best paper award TechPulse 2012, and WSDM 2013
tl:dr the paper shows how to use sketches to find causality relations between billions of events using trillions of observations. [bib]
Nir Ailon, Edo Liberty
SODA 2011, Best paper award
tl:dr The main result of my PhD work on fast dimension reduction. Specifically, matching the optimal target dimension and running time of the Johnson-Lindenstrauss lemma with fast random Hadamard based projection algorithms. Random Hadamard transformations are, by now, a workhorse of vector databases, quantization, and model compression. [bib]
Zohar Karnin and Edo Liberty
COLT 2019
tl;dr: This ML-theory paper shows that many types of machine learning models have much smaller coresets than those previously known. As a special case of the general result, it resolves the open problem regarding the coreset complexity of gaussian density estimation.
Edo Liberty, Alexandr Andoni, Eldar Kleiner
Arxiv 2026
Edo Liberty
Arxiv 2022
Aditya Krishnan, Edo Liberty
Arxiv 2021
Graham Cormode, Zohar Karnin, Edo Liberty, Justin Thaler, Pavel Veselý
Best paper at PODS 2021
Pigi Kouki, Ilias Fountalis, Nikolaos Vasiloglou, Xiquan Cui, Edo Liberty, Khalifeh Al Jadda
RecSys 2020
Edo Liberty, Zohar Karnin, Bing Xiang, Laurence Rouesnel, Baris Coskun, Ramesh Nallapati, Julio Delgado Mangas, Amir Sadoughi, Yury Astashonok, Piali Das, Can Balioglu, Saswata Chakravarty, Madhav Jha, Philip Gautier, Tim Januschowski, Valentin Flunkert, David Arpin, and Alex Smola.
SIGMOD 2020
Nikita Ivkin, Edo Liberty, Kevin Lang, Zohar Karnin, Vladimir Braverman
Sensors 2022
Zohar Karnin and Edo Liberty
COLT 2019
Nicholas Ryder, Zohar Karnin, and Edo Liberty
ARXIV 2019
Yu Bai, Yu-Xiang Wang, Edo Liberty
ICLR 2019
Daniel Anderson, Pryce Bevan, Kevin Lang, Edo Liberty, Lee Rhodes, Justin Thaler
IMC 2017
Edo Liberty, Maxim Sviridenko, Approx 2017
Zohar Karnin, Kevin Lang, Edo Liberty
FOCS 2016
Edo Liberty
Arxiv 2016
Professor Wenjian Yu of Tsinghua University pointed out that a the square was omitted from (1+eps) in equation 2. The proof is still correct after a straight forward correction. This will be corrected in the next version.
Mina Ghashami, Edo Liberty, Jeff M. Phillips
KDD 2016
Kevin Lang, Edo Liberty, Konstantin Shmakov
ICML 2016 [slides]
Edo Liberty, Michael Mitzenmacher, Justin Thaler, Jonathan Ullman
PODS 2016
Edo Liberty, Ram Sriharsha, Maxim Sviridenko
ALENEX 2016 [bib]
Zohar Karnin, Edo Liberty
COLT 2015
(see also 5 minute video lecture)
Christos Boutsidis, Dan Garber, Zohar Karnin, Edo Liberty
SODA 2014 [bib]
Dimitris Achlioptas, Zohar Karnin, Edo Liberty
NIPS 2013 [bib]
Edo Liberty (see slides and experimental results in json format)
Also, here is talk I gave at the Simons Institute about this.
Best paper at KDD 2013 [bib]
See also frequent direction git repo by Mina Ghashami and myself.
Nir Ailon, Zohar Karnin, Edo Liberty, Yoelle Maarek
Best paper at TechPulse 2012 and WSDM 2013 [bib]
Zohar Karnin, Edo Liberty, Shachar Lovett, Roy Schwartz, and Omri Weinstein
Liran Katzir, Edo Liberty, and Oren Somekh
WWW 2012 [bib]
Nir Ailon, Edo Liberty
Best paper at SODA 2011 [bib]
Nir Ailon, Noa Avigdor-Elgrabli, Edo Liberty, Anke van Zuylen
Yehuda Koren, Edo Liberty,Yoelle Maarek, and Roman Sandler
KDD 2011 [bib]
Liran Katzir, Edo Liberty, and Oren Somekh
WWW 2011 [bib]
Gal Lavee, Ronny Lempel, Edo Liberty, and Oren Somekh
WWW 2011 [bib]
Nir Ailon, Edo Liberty
ICALP 2009 [bib]
Edo Liberty, Nir Ailon, Amit Singer
RANDOM 2008 [bib]
Nir Ailon, Edo Liberty
SODA 2008 [bib]
Sebastian Bruch, Franco Maria Nardini, Amir Ingber, Edo Liberty
TOIS - ACM Transactions on Information Systems 2023
Mina Ghashami, Edo Liberty, Jeff M. Phillips, David P. Woodruff
Liran Katzir, Edo Liberty, Oren Somekh, Ioana A. Cosma
Journal of Internet Mathematics [bib]
Nir Ailon, Edo Liberty
Transactions on Algorithms [bib]
Nir Ailon, Noa Avigdor-Elgrabli, Edo Liberty, and Anke van Zuylen
SIAM Journal on Computing [bib]
Zohar Karnin, Edo Liberty, Shachar Lovett, Roy Schwartz and Omri Weinstein
JMLR 2012 (Journal of Machine Learning Research) [bib]
Edo Liberty, Nir Ailon, Amit Singer
DCG 2010 (Discrete and Computational Geometry) [bib]
Edo Liberty, Steven Zucker
IPL 2009 (Information Processing Letters) [bib]
Nir Ailon, Edo Liberty
DCG 2008 (Discrete and Computational Geometry) [bib]
Edo Liberty, Franco Woolfe, Vladimir Rokhlin, and Mark Tygert
ACHA 2008 (Applied and Computational Harmonic Analysis) [bib]
Edo Liberty, Franco Woolfe, Per-Gunnar Martinsson, Vladimir Rokhlin, and Mark Tygert.
PNAS 2007 (Proceedings of the National Academy of Sciences) [bib]
Roni Ilan, Edo Liberty, Shahar Even-Dar Mandel, and Ron Lifshitz.
Ferroelectrics 2004.
PhD Thesis. See also Talk slides