Activity
Mon
Wed
Fri
Sun
Oct
Nov
Dec
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
What is this?
Less
More

Memberships

Data Alchemy

Public • 19.7k • Free

493 contributions to Data Alchemy
Automatically Boost your RAG Knowledgebase with Images | Free plug-n-play script to do so on auto-pilot
Hey everyone! A few days ago, I posted an article about a Python script I developed that helped me seamlessly transcribe and integrate 3,000+ visual slides full of crucial information into my RAG Knowledgebase—completely on autopilot. Now, I’ve given the script a full makeover to make it as easy as possible to plug-and-play! The article took off on Medium, so I knew I had to bring it here for you all to benefit: [ARTICLE LINK + SCRIPT LINK & FREE PROMPT TEMPLATES] [LinkedIn Post link in case you prefer it] This all started because I was developing a RAG chatbot for a client and... I ran into a pretty big problem. All the info he provided was in slides full of text & images! I had to figure out a way to implement these slides into the Knowledgebase (and it had to work). So I got to work and developed a script to do so on autopilot. It leverages the vision capabilities of the latest LLM models to transcribe the slides accurately—OCR just couldn’t cut it because it ignores the layout and misses out on crucial images. But that’s not all! The script is loaded with extra features: - SUMMARIZING & VECTORIZING: The script doesn’t just transcribe; it summarizes key concepts and creates vectors to ensure your Knowledgebase captures everything. These vectors are essential for data integration. - FOLDER PROCESSING: It processes every subfolder in your directory, so no image gets left behind. Perfect for managing large datasets. - SMART FILE NAMING: The script updates transcription filenames with vector counts, so you always know where you stand. - MERGING TRANSCRIPTIONS: You can merge all transcriptions into a single file—whether by folder or into one master file—keeping your data organized and accessible. - VECTOR COUNTING: Get a quick snapshot of your data volume with vector counts for each main folder—great for ensuring completeness. - VECTOR UPLOADING: Finally, it uploads all vectors to the Qdrant vector store (but you can switch to another provider with a simple code tweak).
9
9
New comment 17d ago
Automatically Boost your RAG Knowledgebase with Images | Free plug-n-play script to do so on auto-pilot
1 like • 17d
@Marcos Santiago Cool, thanks for sharing!
New
Hi everyone my name is Jorge Lopez I'm actually new here in this community
18
36
New comment 12d ago
2 likes • 17d
@Jorge Lopez Welcome!
Job referral
Hi everyone, I’m currently searching for new job opportunities and would greatly appreciate any leads or advice. I’m skilled in Python and have hands-on experience with technologies like Flask, Django REST Framework, and MySQL. I’m also familiar with Generative AI and am excited about roles that involve these technologies. If you know of any openings or have suggestions on where to look, please let me know! Thanks in advance for your support!
5
2
New comment 17d ago
0 likes • 17d
Hi Kalyani, share your LinkedIn and GitHub with us so we can befriend you and you can this way expand your network 😉. Also if you are searching for opportunities, try onsite opportunities there is way less way competition than remote.
Confused About the AI Job Landscape?
Data Analysis & Business Intelligence: - Positions: Data Analyst, BI Analyst, Business Analyst, Marketing Analyst, among others. - Primary Tasks: Analyzing data, creating strategies, and enhancing performance. Data Science & Machine Learning: - Positions: Data Scientist, Machine Learning Engineer, Deep Learning Engineer, Predictive Modeler, etc. - Primary Tasks: Creating algorithms, building predictive models, and deriving insights from data. AI & Robotics: - Positions: AI Developer, AI Research Scientist, Robotics Engineer, Autonomous Systems Engineer, and similar roles. - Primary Tasks: Developing AI applications, designing robotic systems, and pushing the boundaries of AI technology. Data Engineering & Management: - Positions: Data Engineer, Data Architect, Database Administrator, Data Governance Manager, and more. - Primary Tasks: Constructing data infrastructure, maintaining data quality, and ensuring data security. Research and Development: - Positions: AI Research Scientist, Computational Linguist, Quantum Machine Learning Researcher, and other R&D roles. - Primary Tasks: Engaging in AI research, creating computational models, and investigating cutting-edge technologies
11
3
New comment 17d ago
Confused About the AI Job Landscape?
1 like • 17d
Cool, thanks for sharing!
Mr. Beast Inspiration
I've been watching some Mr. Beast interviews lately and I find the fact that he used analytics to become the most subscribed youtuber of all time fascinating. One thing he mentioned in his backstory; was that he found 4 or 5 other people that wanted to be youtuber early in his career and jumped on calls with them every day for long periods of time and they shared everything they learned. In his words this 5X'd his learning speed. I would like to do the same for Data Science / AI freelancing. I am looking for ~4 other people to jump on calls with ~every day to discuss Data Science / AI Freelancing. I love this community but I want to do what Mr. Beast did as well. So leave a comment or message me if you are interested. This is not an attempt to sell anything or build any kind of funnel so I hope it's not against the rules. Thank you for reading.
15
23
New comment 9d ago
3 likes • 25d
Hi @Benjamin Jazayeri, I think your idea is great!
1 like • 17d
@Benjamin Jazayeri I just entered in the discord group 😉.
1-10 of 493
Ana Crosatto Thomsen
7
5,766points to level up
@ana-crosatto-thomsen
Passionate about data science, exploring the frontiers of Data and AI. Dedicated to crafting innovation, one line of code at a time!🌟

Active 10h ago
Joined Sep 11, 2023
INFJ
Brazil
powered by