Video yükleniyor...

Video Yüklenemedi

Ana Sayfaya Dön

Tune Studio is an end-to-end platform for developing applications using Large Language Models. So far, I haven't seen any other platform like this one. You can do everything here: 1. You can curate your data. 2. Use the playground to play with different models and try your ideas. 3....

39,101 görüntüleme • 2 yıl önce •via X (Twitter)

5 Yorum

Anshuman profil fotoğrafı
Anshuman2 yıl önce

🔥🔥🔥

Assist profil fotoğrafı
Assist2 yıl önce

Do you need a social media manager for your business or personal brand? We’re here to help! Send us a message to discuss a plan that works for you. We can manage and create content for any platform and any industry. Platforms including Facebook, Instagram, Twitter X, YouTube, TikTok, Pinterest, LinkedIn, and more! Learn more ➡️

dkg://Lyvn profil fotoğrafı
dkg://Lyvn2 yıl önce

is similar and opensource. Will give tune studio a try and see how they compare. Thanks

Derek Truesdell profil fotoğrafı
Derek Truesdell2 yıl önce

I tried to sign up but got "response 404 (backend NotFound), service rules for the path non-existent" when I tried to follow the process. That is not a great sign =)

Anton Ansalmar profil fotoğrafı
Anton Ansalmar2 yıl önce

👍

Benzer Videolar

99% of AI applications are cool-looking demos. Impressive, but don't get fooled by the hype. It takes a lot to build enterprise-grade products that deliver real value. I have at least three weekly conversations with companies that want to use a Large Language Model with their data. The demand is huge! Here is one idea about what you can do to help. The use cases that most of these companies want to solve are similar: They have an extensive knowledge base and want to build a simple application that uses that information to answer questions. In other words, they need help building Retrieval Augmented Generation (RAG) applications they can use in many different scenarios: 1. To train new employees 2. To help their support team 3. To search old meetings and documents 4. To help with their research However, building these systems is not straightforward. Yes, there's a lot of information online, but there aren't enough people who know how to create solutions that work. Here is the idea: Today, you can build an enterprise-grade RAG application without writing code. A couple of MIT PhDs with 10+ years of experience building AI applications created . It's a no-code platform for building applications using Large Language Models. They are partnering with me on this post. You can use Stack AI to create, test, and deploy an end-to-end production-ready AI system. It's SOC-2, HIPAA, and GDPR compliant and offers SSO, role management, access control, and on-premise deployments. Of course, you can use the platform with any LLM on the market now. It's the whole nine yards for building AI applications. Check them out here: 2023 was about models. 2024 is about the tools using these models to build production-ready applications. That's where I'd start.

Santiago

197,675 görüntüleme • 2 yıl önce

"Introducing Multimodal Llama 3.2": As promised two weeks ago, here's the short course on Meta's latest open model! This short course is created with Meta and taught by Amit Sangani, Director of AI Partner Engineering at Meta. Meta’s Llama family of models is leading the way in open models, allowing anyone to download, customize, fine-tune, or build new applications on top of them. Learn about the vision capabilities of the Llama 3.2, and use it for image classification, prompting, tokenization, tool-calling. You'll also learn about the open-source Llama stack, which gives building blocks for many different stages of the LLM application life cycle. In detail, you’ll: - Learn what are the features of Meta's four newest models, and when to use which Llama model. - Learn best practices for multimodal prompting, with applications to advanced image reasoning, illustrated by many examples: Understanding errors on a car dashboard, adding up the total of photographed restaurant receipts, grading written math homework. - Use different roles—system, user, assistant, ipython—in the Llama 3.1 and 3.2 models and the prompt format that identifies those roles. - Understand how Llama uses the tiktoken tokenizer, and how it has expanded to a 128k vocabulary size that improves encoding efficiency and multilingual support. - Learn how to prompt Llama to call built-in and custom tools (functions) with examples for web search and solving math equations. - Learn about Llama Stack, a standardized interface for common toolchain components like fine-tuning or synthetic data generation, useful for building agentic applications. By the end of this course, you’ll be equipped to build out new applications with the new Llama 3.2. Thank you to Ahmad Al-Dahle, Amit Sangani, and the whole AI at Meta team AI at Meta for all the hard work on Llama 3.2 — we’re excited to make these open models even more accessible to more developers with this new course! Please sign up here!

Andrew Ng

131,606 görüntüleme • 1 yıl önce

How can you solve complex tasks using a Large Language Model? Here is a 2-minute introduction to everything you need to know to 10x the quality of your results. Let's talk about three techniques, in order of complexity, starting with the easiest one: • In-Context Learning • Indexing + In-Context Learning • Fine-tuning In-Context Learning The team that trained GPT-3 found something they couldn't explain: You can condition a model using examples of how you want it to behave. I included an example prompt in the attached video. You can "teach" the model how you want it to interpret questions, select the correct answers, and format the results by giving a few examples. You can also give specific knowledge to the model that will be helpful when formulating answers. We call this approach "grounding the model." There's another example in the video. Indexing + In-Context Learning Unfortunately, there is a limit to how much data you can include in a prompt. We call this the "context size." One version of GPT-4 supports a context of approximately 6,000 words, while the other supports 25,000 words. Although this sounds like a lot, many applications need more than that. Imagine you wrote a book and want to build an application to answer any questions about your story. What happens if your book is longer than the context? That's where Indexing comes in. Using a model, you can turn every book passage into an embedding. These are vectors, numbers that "encode" the passage's text. You can then store these embeddings in a particular database that supports fast retrieval of these vectors. You can then turn any question into an embedding and search the database for the list of passages that are similar to that query. Instead of using the entire book to ask the model, you can now use the relevant passages as in-context information, effectively working around the context size limitation. Fine-tuning Fine-tuning can give you an extra boost to get reliable outputs from your LLM. It is, however, the most complex approach on the list. There are different approaches to fine-tuning a model with your data. A popular technique is to process your data with your LLM and use the outputs to train a new classifier that solves your specific task. Notice that here you aren't modifying the LLM. Instead, you are chaining it with your trained classifier. Another approach is to modify the parameters of the LLM using your data. Think of this as "rewiring" the model in a way that solves your particular task. The results and costs will vary depending on how many layers you want to fine-tune from the original model. Many companies think that fine-tuning is the solution to their problems. In my experience, many will benefit from exploring the other two approaches. I love explaining Machine Learning and Artificial Intelligence ideas. If you enjoy in-depth content like this, follow me Santiago so you don't miss what comes next.

Santiago

384,028 görüntüleme • 3 yıl önce

This is next-level smart: An open-source platform that evaluates your prompts and automatically refines them based on the results. ​ Of course, it feels obvious after you see it: ​ • You write a prompt • The system evaluates it across different scenarios • Based on the results, it refines it to improve results ​ I recorded a quick video to show you how it works. It's pretty cool stuff! ​ Here are some of the problems and best practices for teams building AI applications: ​ 1. Testing your prompts manually doesn't scale 2. Prompts should not be spread throughout the codebase 3. Non-technical people need easy access to your prompts 4. Prompts can always use a version history to track changes 5. Monitoring the performance of prompts overtime is critical ​ Evaluating the prompts is what keeps me up at night from this list. Of all the conversations I've had with companies and people building AI applications, this is the area that's causing the most pain. ​ Testing a prompt is difficult. Think about how you'd test the response of a model subjectively. What do you account for, "tone," "objectivity," "completeness," "creativity," "readability," etc.? ​ Last week, I met the developers behind Latitude, an open-source prompt engineering platform trying to solve all of these issues. You can try the platform in two ways: ​ • You can self-host the platform. Free and open-source. • If you want to try their online product, their free tier is huge. ​ Here is the link: ​ Thanks to the Latitude team for collaborating with me on this post, and congratulations on going live with their product!

Santiago

64,141 görüntüleme • 1 yıl önce