Vector Podcast from Berlin Buzzwords’24: Sonam Pankaj, EmbedAnything

Dmitry Kan
1 min readSep 19, 2024

--

I’ve just released an episode with Sonam Pankaj. She works on EmbedAnything. We have recorded this episode at Berlin Buzzwords back in June, where I also got the chance to test my new audio recording gear (RØDE Wireless GO II).

EmbedAnything is an infrastructure layer, that allows you to embed anything (different text formats, but also other modalities, like audio), written in Rust for performance reasons. It can embed a pdf text 40x faster than in Python.

EmbedAnything sits on the same level as Encoders layer in the Vector Search Pyramid: https://medium.com/@dmitry-kan/neural-search-frameworks-a-head-to-head-comparison-976aa6662d20

We spoke about this project in detail, but also about metric learning, quality assurance and multimodality.

There are a bunch of show notes with different papers and projects — do check them out.

RSS: https://rss.com/podcasts/vector-podcast/1663042/

Spotify: https://open.spotify.com/episode/5pUWz19iWKHqUzNT0JQ9KL

Apple Podcasts: https://podcasts.apple.com/fi/podcast/berlin-buzzwords-2024-sonam-pankaj-embedanything/id1587568733?i=1000670040161

Patreon: https://www.patreon.com/posts/vector-podcast-112350470?utm_medium=clipboard_copy&utm_source=copyLink&utm_campaign=postshare_creator&utm_content=join_link

--

--

Dmitry Kan

Founder and host of Vector Podcast, tech team lead, software engineer, manager, but also: cat lover and cyclist. Host: https://www.youtube.com/c/VectorPodcast