DataTalksClub ⬛
DataTalksClub ⬛
  • 464
  • 1 200 335
Open-Source Spotlight - Ibis - Chloe He
Ibis: the portable Python data frame library
Timecodes:
00:00 Intro
00:56 What does it mean for Ibis to be a portable dataframes library?
2:40 Demo: analyzing IMDB data using Ibis
4:21 Connect to DuckDB from Ibis
7:14 Data cleaning using Ibis
10:26 Interspersing plain SQL with Ibis methods
13:30 Chaining queries to find the top 10 best movies by rating
18:31 Execute the same expression in Postgres!
20:12 Can I execute some transformation in DuckDB and write the results to Postgres?
22:06 Ibis streaming
24:18 How to contribute to Ibis?
26:44 Zulip community
27:51 What’s on the roadmap for Ibis?
29:57 Advice: be more open-minded and step out of your comfort zone
Links:
- GitHub repo: github.com/ibis-project/ibis
- Documentation: ibis-project.org/
- Zulip community: ibis-project.zulipchat.com/
- LinkedIn: www.linkedin.com/company/ibis-project/
- Twitter/X: x.com/IbisData
Free LLM course: github.com/DataTalksClub/llm-zoomcamp
Join DataTalks.Club: datatalks.club/slack.html
Our events: datatalks.club/events.html
Переглядів: 495

Відео

Open-Source Spotlight - JuiceFS - Brent Bai
Переглядів 398Місяць тому
JuiceFS is a high-performance, cloud-native, distributed file system Timecodes: 00:00 Intro and a few words about Brent 01:23 Introduction of JuiceFS 03:23 JuiceFS architecture 04:47 Demo resources walk through 06:25 Install JuiceFS 08:52 Create a JuiceFS file system 10:55 Mount a JuiceFS file system 12:42 Performance benchmark 18:55 JuiceFS CSI driver for kubernetes 24:33 Shared file system be...
LLM Zoomcamp 1.1 - Introduction to LLM and RAG
Переглядів 10 тис.Місяць тому
Welcome to the first module of our course, LLM Zoomcamp! We cover the applications of LLM, focusing on RAG: retrieval augmented generation. Throughout the course, we will build a Q&A system using the FAQ data from our courses. We don't cover the theory behind LLMs, but we will learn how to utilize them effectively. Timecodes: 00:00 Introduction to LLM Zoomcamp 04:03 Understanding LLMs 09:15 Exp...
Open-Source Spotlight - FiftyOne - Jacob Marks
Переглядів 3862 місяці тому
Open-Source Spotlight is our series where we're discovering open-source tools. This time, we discussed FiftyOne, an open-source tool for building high-quality datasets and computer vision models. Timecodes: 00:00 Intro and What is FiftyOne 01:18 Running FiftyOne in a notebook 04:49 Navigating the FiftyOne App 09:20 Running Zero-Shot Models 11:04 Filtering by Image Brightness 11:02 Loading a Vid...
Project of the week: DIY Data Version Control (DVC)
Переглядів 6192 місяці тому
Project of the week: DIY Data Version Control (DVC)
Open-Source Spotlight - Fluvio - Debadyuti Roy Chowdhury
Переглядів 6275 місяців тому
Open-Source Spotlight - Fluvio - Debadyuti Roy Chowdhury
Open-Source Spotlight - Encord Active - Frederik Hvilshøj
Переглядів 4005 місяців тому
Open-Source Spotlight - Encord Active - Frederik Hvilshøj
Open-Source Spotlight - Timeplus Proton - Jove Zhong
Переглядів 4977 місяців тому
Open-Source Spotlight - Timeplus Proton - Jove Zhong
Open-Source Spotlight - SuperDuperDB - Duncan Blythe
Переглядів 5677 місяців тому
Open-Source Spotlight - SuperDuperDB - Duncan Blythe
Open-Source Spotlight - CrateDB - Karyn Azevedo
Переглядів 6538 місяців тому
Open-Source Spotlight - CrateDB - Karyn Azevedo
Open-Source Spotlight - DuckDB - Gabor Szarnyas
Переглядів 1,2 тис.9 місяців тому
Open-Source Spotlight - DuckDB - Gabor Szarnyas
Open-Source Spotlight - CNDI - Matthew Johnston
Переглядів 3409 місяців тому
Open-Source Spotlight - CNDI - Matthew Johnston
Open-Source Spotlight - Determined - Isha Ghodgaonkar
Переглядів 5449 місяців тому
Open-Source Spotlight - Determined - Isha Ghodgaonkar
Open-Source Spotlight - dlt - Alena Astrakhantseva
Переглядів 2,2 тис.10 місяців тому
Open-Source Spotlight - dlt - Alena Astrakhantseva
Open-Source Spotlight - LLM App for real-time data - Bobur Umurzokov
Переглядів 1,9 тис.10 місяців тому
Open-Source Spotlight - LLM App for real-time data - Bobur Umurzokov
Open-Source Spotlight - Dolphin Scheduler - Eric Gao
Переглядів 76310 місяців тому
Open-Source Spotlight - Dolphin Scheduler - Eric Gao
Open-Source Spotlight - Clippinator - Lev Chizhov and Sergei Bogdanov
Переглядів 55710 місяців тому
Open-Source Spotlight - Clippinator - Lev Chizhov and Sergei Bogdanov
Open-Source Spotlight - Titan Takeoff - Fergus Finn
Переглядів 40711 місяців тому
Open-Source Spotlight - Titan Takeoff - Fergus Finn
Open-Source Spotlight - Autolabel - Rishabh Bhargava
Переглядів 68711 місяців тому
Open-Source Spotlight - Autolabel - Rishabh Bhargava
The Good, the Bad and the Ugly of GPT - Sandra Kublik
Переглядів 627Рік тому
The Good, the Bad and the Ugly of GPT - Sandra Kublik
Open-Source Spotlight - Dozer - Abhishek Mishra
Переглядів 666Рік тому
Open-Source Spotlight - Dozer - Abhishek Mishra
Open-Source Spotlight - Alibi Detect - Ashley Scillitoe
Переглядів 760Рік тому
Open-Source Spotlight - Alibi Detect - Ashley Scillitoe
Open-Source Spotlight - dstack - Andrey Cheptsov
Переглядів 386Рік тому
Open-Source Spotlight - dstack - Andrey Cheptsov
Open-Source Spotlight - Metaflow - Hugo Bowne-Anderson
Переглядів 814Рік тому
Open-Source Spotlight - Metaflow - Hugo Bowne-Anderson
Open-Source Spotlight - Quix Streams - Tomas Neubauer
Переглядів 378Рік тому
Open-Source Spotlight - Quix Streams - Tomas Neubauer
Open-Source Spotlight - Dash - Adam Schroeder
Переглядів 657Рік тому
Open-Source Spotlight - Dash - Adam Schroeder
Open-Source Spotlight - YOLO-NAS - Harpreet Sahota
Переглядів 564Рік тому
Open-Source Spotlight - YOLO-NAS - Harpreet Sahota
Open-Source Spotlight - JupySQL - Eduardo Blancas
Переглядів 694Рік тому
Open-Source Spotlight - JupySQL - Eduardo Blancas
Open-Source Spotlight - Hamilton - Stefan Krawczyk
Переглядів 437Рік тому
Open-Source Spotlight - Hamilton - Stefan Krawczyk
Open-Source Spotlight - Phoenix - Xander Song
Переглядів 765Рік тому
Open-Source Spotlight - Phoenix - Xander Song

КОМЕНТАРІ

  • @prozaclink
    @prozaclink 9 годин тому

    Is docker mandatory to run elasticsearch?

  • @olayinkaadegboye4556
    @olayinkaadegboye4556 День тому

    Hello, what's he role of a Data engineer

  • @vindolanda6974
    @vindolanda6974 4 дні тому

    Wow, a content-less presentation. The topics list was good, the list of three sample questions was ok. But then didn't actually have any content about system design after that. Reminds me of when I was a consultant and didn't have time to prep for a presentation.

  • @jeromeeusebius
    @jeromeeusebius 9 днів тому

    Thanks Akela Drissner for the workshop. Very informative. As Alexey said, people shouldn't be overwhelmed by the details. Just take some time and go over the notebook again. Good thing is, it has all the details you will need for adaptation to another use case.

  • @konutek7716
    @konutek7716 14 днів тому

    Thank you for the workshop

  • @garlicbreaddotcom
    @garlicbreaddotcom 18 днів тому

    Awesome workshop, fortunately greatly improved by the fabulous sound quality from the mic of Akela Drissner

  • @nymishareddy09
    @nymishareddy09 19 днів тому

    Thank you! Exactly what I was looking for

  • @ClimateDS
    @ClimateDS 23 дні тому

    Great workshop, which is unfortunately a bit diminished by the low sound quality from the mic of Akela Drissner :/

  • @Buy_YT_Views.610
    @Buy_YT_Views.610 27 днів тому

    The cinematography is pure magic.