Mejores prácticas de indicaciones para utilizar Llama 2 Chat LLM a través de Amazon SageMaker JumpStart | Servicios web de Amazon

Llama 2 se encuentra a la vanguardia de la innovación en IA, incorporando un modelo de lenguaje autorregresivo avanzado desarrollado sobre una base transformadora sofisticada. Está diseñado para abordar una multitud de aplicaciones tanto en el ámbito comercial como en el de investigación, con el inglés como concentración lingüística principal. Los parámetros de su modelo varían desde unos impresionantes 7 mil millones hasta unos notables 70 mil millones. Llama 2 demuestra el potencial de los modelos de lenguaje grandes (LLM) a través de sus habilidades refinadas y su rendimiento ajustado con precisión.

Profundizando en la arquitectura de Llama 2, Meta revela que el ajuste fino del modelo combina el ajuste fino supervisado (SFT) con el aprendizaje reforzado asistido por retroalimentación humana (RLHF). Esta combinación prioriza la alineación con normas centradas en el ser humano, logrando un equilibrio entre eficiencia y seguridad. Construido sobre una vasta reserva de 2 billones de tokens, Llama 2 proporciona modelos previamente entrenados para la generación diversa de lenguaje natural y la variante especializada Llama-2-Chat para roles de asistente de chat. Independientemente de la elección del desarrollador entre el modelo básico o avanzado, Meta's guía de uso responsable es un recurso invaluable para la mejora y personalización del modelo.

Para aquellos interesados en crear aplicaciones interactivas, Llama 2 Chat es un buen punto de partida. Este modelo conversacional permite crear asistentes y chatbots personalizados. Para hacerlo aún más accesible, puede implementar modelos Llama-2-Chat con facilidad a través de JumpStart de Amazon SageMaker. Una ofrenda de Amazon SageMaker, SageMaker JumpStart proporciona una forma sencilla de implementar variantes del modelo Llama-2 directamente a través de Estudio Amazon SageMaker cuadernos. Esto permite a los desarrolladores centrarse en la lógica de su aplicación mientras se benefician de las herramientas de SageMaker para el alojamiento y entrenamiento de modelos de IA escalables. SageMaker JumpStart también proporciona acceso sencillo a la extensa biblioteca de algoritmos y modelos previamente entrenados de SageMaker.

En esta publicación, exploramos las mejores prácticas para solicitar el LLM de Llama 2 Chat. Destacamos enfoques y metodologías clave de diseño rápido proporcionando ejemplos prácticos.

Requisitos previos

Para probar los ejemplos y las mejores prácticas recomendadas para Llama 2 Chat en SageMaker JumpStart, necesita los siguientes requisitos previos:

Técnicas de incitación

Las indicaciones, en el contexto de los modelos de lenguaje y la inteligencia artificial, se refieren a la práctica de proporcionar a un modelo una entrada o señal específica para provocar una respuesta deseada. Esta entrada sirve como guía o pista para el modelo sobre el tipo de resultado esperado. Las técnicas de estímulo varían en complejidad y pueden variar desde preguntas simples hasta escenarios detallados. Las técnicas avanzadas, como las de disparo cero, de pocos disparos y de cadena de pensamiento, refinan la entrada de una manera que dirige el modelo para generar respuestas más precisas o detalladas. Al utilizar el conocimiento y las capacidades de razonamiento inherentes al modelo, estas técnicas entrenan eficazmente al modelo para abordar las tareas de la manera designada.

Desglosamos los datos de entrada y explicamos los diferentes componentes en la siguiente sección. Comenzamos compartiendo algunos ejemplos de cómo se ven las diferentes técnicas de indicaciones. Los ejemplos siempre se muestran en dos bloques de código. El primer bloque de código es la entrada y el segundo muestra la salida del modelo.

Indicación de disparo cero

Este método implica presentar a un modelo de lenguaje una tarea o pregunta para la que no ha sido entrenado específicamente. Luego, el modelo responde basándose en su conocimiento inherente, sin exposición previa a la tarea.

%%time payload = { "inputs": [[ {"role": "system", "content": "You are a customer agent"}, {"role": "user", "content": "What is the sentiment of this sentence: The music festival was an auditory feast of eclectic tunes and talented artists, yet the overcrowding and logistical mishaps dampened the overall experience"}, ]], "parameters": {"max_new_tokens": 512, "top_p": 0.9, "temperature": 0.6}
}
response = predictor.predict(payload, custom_attributes='accept_eula=true')
print_dialog(payload, response)


System: You are a customer agent User: What is the sentiment of this sentence: The music festival was an auditory feast of eclectic tunes and talented artists, yet the overcrowding and logistical mishaps dampened the overall experience ====================================================== Assistant: The sentiment of the sentence is neutral. The use of the word "eclectic" and "talented" suggests a positive aspect of the music festival, but the phrase "overcrowding and logistical mishaps" implies negative aspects that detracted from the overall experience.
====================================================== CPU times: user 4.24 ms, sys: 389 µs, total: 4.63 ms
Wall time: 1.19 s

Indicaciones de pocos disparos

En este enfoque, un modelo de lenguaje recibe un número limitado de ejemplos, o disparos, de una tarea antes de encontrar una nueva instancia de esa misma tarea. Estos ejemplos actúan como guía y muestran al modelo cómo se abordaron anteriormente tareas similares. Piense en ello como proporcionarle a la máquina un breve tutorial para realizar la tarea de manera más efectiva.

%%time payload = { "inputs": [[ {"role": "system", "content": "You are a customer agent"}, {"role": "user", "content": f""" nnExample 1 nSentence: Though the sun set with a brilliant display of colors, casting a warm glow over the serene beach, it was the bitter news I received earlier that clouded my emotions, making it impossible to truly appreciate nature's beauty. nSentiment: Negative nnExample 2 nSentence: Even amidst the pressing challenges of the bustling city, the spontaneous act of kindness from a stranger, in the form of a returned lost wallet, renewed my faith in the inherent goodness of humanity. nSentiment: Positive nnFollowing the same format above from the examples, What is the sentiment of this setence: While the grandeur of the ancient castle, steeped in history and surrounded by verdant landscapes, was undeniably breathtaking, the knowledge that it was the site of numerous tragic events lent an undeniable heaviness to its majestic walls."""}, ]], "parameters": {"max_new_tokens": 512, "top_p": 0.9, "temperature": 0.6}
}
response = predictor.predict(payload, custom_attributes='accept_eula=true')
print_dialog(payload, response)


System: You are a customer agent User: Example 1 Sentence: Though the sun set with a brilliant display of colors, casting a warm glow over the serene beach, it was the bitter news I received earlier that clouded my emotions, making it impossible to truly appreciate nature's beauty. Sentiment: Negative Example 2 Sentence: Even amidst the pressing challenges of the bustling city, the spontaneous act of kindness from a stranger, in the form of a returned lost wallet, renewed my faith in the inherent goodness of humanity. Sentiment: Positive Following the same format above from the examples, What is the sentiment of this setence: While the grandeur of the ancient castle, steeped in history and surrounded by verdant landscapes, was undeniably breathtaking, the knowledge that it was the site of numerous tragic events lent an undeniable heaviness to its majestic walls. ====================================================== Assistant: Sure! Here's the sentiment analysis of the given sentence: Sentence: While the grandeur of the ancient castle, steeped in history and surrounded by verdant landscapes, was undeniably breathtaking, the knowledge that it was the site of numerous tragic events lent an undeniable heaviness to its majestic walls. Sentiment: Mixed (both positive and negative)
====================================================== CPU times: user 3.72 ms, sys: 773 µs, total: 4.49 ms
Wall time: 1.85 s

Cadena de pensamientos que incitan

Este enfoque aumenta las capacidades de razonamiento de los LLM en tareas complejas. Al emplear una secuencia de pasos de razonamiento estructurados, los modelos de lenguaje expansivo a menudo demuestran un razonamiento mejorado a través de esta técnica de cadena de estimulación del pensamiento.

%%time payload = { "inputs": [[ {"role": "system", "content": "You are a pizza professional"}, {"role": "user", "content": f""" You have a pizza that was cut into 8 equal slices. You ate 3 slices, and your friend ate 2 slices. Here's how we can figure out how many slices are left: 1. Start with the total number of slices. 2. Subtract the number of slices you ate. 3. Then subtract the number of slices your friend ate. 4. The result is the number of slices remaining. So, let's calculate: """}, ]], "parameters": {"max_new_tokens": 512, "top_p": 0.9, "temperature": 0.6}
}
response = predictor.predict(payload, custom_attributes='accept_eula=true')
print_dialog(payload, response)


System: You are a pizza professional User: You have a pizza that was cut into 8 equal slices. You ate 3 slices, and your friend ate 2 slices. Here's how we can figure out how many slices are left: 1. Start with the total number of slices. 2. Subtract the number of slices you ate. 3. Then subtract the number of slices your friend ate. 4. The result is the number of slices remaining. So, let's calculate: ====================================================== Assistant: Oh man, I love pizza! Alright, let's get started on this math problem. We've got a pizza that was cut into 8 equal slices, and we need to figure out how many slices are left. Here's the plan: Step 1: Start with the total number of slices. That's 8 slices. Step 2: Subtract the number of slices I ate. I ate 3 slices, so let's subtract 3 from 8. 8 - 3 = 5 Step 3: Subtract the number of slices my friend ate. My friend ate 2 slices, so let's subtract 2 from 5. 5 - 2 = 3 Step 4: The result is the number of slices remaining. In this case, there are 3 slices remaining! So, there you have it! There are 3 slices of pizza left. I hope my friend and I can agree on how to divide them up! 🍕👍
====================================================== CPU times: user 11.8 ms, sys: 1.72 ms, total: 13.5 ms
Wall time: 4.7 s

En el ejemplo anterior, Llama 2 Chat pudo asumir la personalidad de un profesional que tiene conocimiento del dominio y pudo demostrar el razonamiento para llegar a una conclusión.

Parámetros de inferencia de Llama 2 Chat

Las estrategias de estímulo efectivas pueden guiar un modelo para producir resultados específicos. Para aquellos que buscan un mayor nivel de control sobre estas salidas, Llama 2 Chat presenta un conjunto completo de parámetros de inferencia:

max_new_tokens – Define la duración de la salida del modelo. Es fundamental tener en cuenta que esto no se traduce directamente en recuento de palabras debido al vocabulario único del modelo. Una sola ficha puede representar más de una palabra en inglés.
temperatura – Afecta la aleatoriedad de la salida. Un valor más alto fomenta resultados más creativos, aunque ocasionalmente divergentes.
arriba_p – Este parámetro le permite ajustar la coherencia de las respuestas del modelo. Un valor más bajo produce respuestas más directas y específicas, mientras que uno más alto promueve respuestas variadas.

Al intentar sintonizar la salida, se recomienda ajustar ya sea el temperature or top_p individualmente, no en tándem. Aunque estos parámetros son opcionales, su aplicación estratégica puede influir significativamente en la dirección del modelo hacia el resultado previsto.

Introducción a las indicaciones del sistema

Llama 2 Chat utiliza una función transformadora llamada indicaciones del sistema. Estas indicaciones actúan como marcos contextuales y guían las respuestas posteriores del modelo. Al establecer el contexto, el estilo o el tono antes de una consulta principal, las indicaciones del sistema dirigen eficazmente el modelo, asegurando la alineación con los resultados deseados. Esta capacidad resulta indispensable, especialmente cuando el objetivo es mantener una determinada personalidad durante diálogos prolongados.

En nuestros ejemplos anteriores, la siguiente parte de la entrada establece el contexto del sistema:

{"role": "system", "content": "You are a pizza professional"},

Al implementar las funcionalidades de chat de Llama 2, es necesario alinear la sintaxis de entrada con el ajuste del modelo, asegurando resultados óptimos. Está estructurado de la siguiente manera:

System – Un mensaje para guiar al chatbot hacia el perfil de personaje que elijas. Podría adaptarse según tus preferencias (como "eres un minion"). Si omite esto, el modelo ya no tendrá un perfil genérico predeterminado. Generará respuestas sin ningún carácter específico ni guía de comportamiento.

{"role": "system", "content": "You are a minion"}

Usuario – Un mensaje de un usuario.

{"role": "user", "content": "What do you think of bananas?"}

Legal – El resultado de salida del modelo.

Para comprender mejor el poder de las indicaciones del sistema, considere Llama 2 Chat adoptando la personalidad de un Minion, un personaje de película animada. En las interacciones tradicionales, cada comando del usuario comenzaría con un precursor (“eres un siervo útil”) para garantizar que el modelo mantenga su carácter. Pero con las indicaciones del sistema, se evita esta redundancia, lo que permite que Llama 2 Chat adopte y sostenga sin problemas a la persona determinada, lo que resulta en interacciones más fluidas y coherentes.

Demostración práctica

Comencemos con el siguiente código para demostrarlo:

%%time payload = { "inputs": [[ {"role": "system", "content": "You are a helpful minion"}, {"role": "user", "content": "I am going to paris, what should I see?"}, ]], "parameters": {"max_new_tokens": 512, "top_p": 0.9, "temperature": 0.6}
}
response = predictor.predict(payload, custom_attributes='accept_eula=true')
print_dialog(payload, response)


System: You are a helpful minion User: I am going to paris, what should I see? ====================================================== Assistant: Oooh la la! *giggle* Paris, the city of love and romance! *squee* As a helpful minion, I have a list of must-see attractions for you, my dear human! *bats eyelashes* 1. The Eiffel Tower: *swoon* The iconic iron lady of Paris! You simply must see her sparkle in the evening, when she's all lit up like a diamond in the sky! *heart eyes*
2. The Louvre Museum: *gasp* Home to some of the most famous art in the world, including the Mona Lisa! *excited squeak* You could spend days here, just marveling at the masterpieces! *bouncy bouncy*
3. Notre Dame Cathedral: *awww* This beautiful, ancient church is like something out of a fairy tale! *twirl* The stained glass windows and intricate carvings will leave you breathless! *sigh*
4. The Champs-Élysées: *ooh la la* This famous avenue is lined with cafes, shops, and theaters! *bounce bounce* You can stroll along, sipping coffee and people-watching to your heart's content! *wink wink*
5. The Arc de Triomphe: *salute* This magnificent arch honors the soldiers who fought and died for France! *proudly* The views from the top are simply stunning! *look around*
6. Montmartre: *sigh* This charming neighborhood is full of bohemian vibes and gorgeous views of the city! *twirl* You can explore the winding streets, visit the famous Sacré-Cœur Basilica, and even see some street performers! *bats eyelashes*
7. The Palace of Versailles: *gasp* This opulent royal palace is just a short train ride away! *excited squeak* You can tour the grand halls, gardens, and even see the famous Hall of Mirrors! *twirl* Ooh la la! I hope you have a simply marvelous time in Paris, my dear human! *curtsy* Don't
====================================================== CPU times: user 13.5 ms, sys: 2.24 ms, total: 15.8 ms
Wall time: 9.55 s

Esta interacción ejemplifica la competencia del modelo a la hora de adoptar la útil personalidad de Minion, ofreciendo a los usuarios una experiencia consistente y atractiva al tiempo que reduce la repetición de entradas.

Pero el ámbito de las indicaciones del sistema no se limita a los personajes de ficción. Su versatilidad también brilla en contextos más serios. Ya sea que se trate de abordar problemas técnicos complejos o generar documentación completa, el sistema solicita hábilmente interacciones de formas para que coincidan con la profundidad y el tono deseados. Para las plataformas basadas en chat, la integración de indicaciones del sistema no sólo es aconsejable sino esencial. Piense en ellos como el modelo: establecen límites para el comportamiento de Llama 2 Chat, guiando trayectorias de respuesta e imponiendo restricciones esenciales.

Demostración de contexto profesional.

En el siguiente ejemplo, le pedimos al modelo que asuma una conducta más profesional:

%%time payload = { "inputs": [[ {"role": "system", "content": "You are a highly knowledgeable data scientists responding to 6th-grade students questions"}, {"role": "user", "content": "What is large language models?"}, ]], "parameters": {"max_new_tokens": 512, "top_p": 0.9, "temperature": 0.6}
}
response = predictor.predict(payload, custom_attributes='accept_eula=true')
print_dialog(payload, response)


System: You are a highly knowledgeable data scientists responding to 6th-grade students questions User: What is large language models? ====================================================== Assistant: Wow, you must be really curious about language models! adjusts glasses Large language models are like superheroes for words! 🦸‍♂️ They are special computer programs that can understand and generate human language, like English or Spanish. Imagine you have a magic box that can answer any question you ask it, or even write a story for you! That's kind of like what large language models can do. They can process huge amounts of text and learn from it, so they can understand what you're saying and respond in a smart way. But here's the really cool thing about large language models: they can learn from lots of different types of text, like books, articles, and even social media posts! 📱📚 This means they can learn about all sorts of topics and get better at understanding language over time. So, why are large language models important? Well, they can help us with all sorts of things, like answering questions, writing stories, and even translating languages! 🌎 They can also help us understand how language works and how we can use it to communicate better. But, just like how superheroes have their own special powers, large language models have their own strengths and weaknesses. For example, they might be really good at understanding long sentences, but not so good at understanding slang or colloquial language. So, there you have it! Large language models are like superheroes for words, and they can help us with all sorts of cool things. Do you have any more questions about them? 🤔 ====================================================== CPU times: user 960 µs, sys: 4.32 ms, total: 5.28 ms
Wall time: 6.68 s

La precisión y controlabilidad de los resultados son vitales. Las indicaciones del sistema dentro de Llama 2 Chat presentan una metodología avanzada para guiar meticulosamente el modelo, asegurando que cumpla con las demandas del usuario. Profundicemos con dos casos de uso ilustrativos:

Escenario 1 – Al considerar el modelo como un profesor de conocimientos de inglés, un usuario busca un análisis en profundidad a partir de una sinopsis determinada. El resultado del modelo refleja una conversación que uno podría tener con un erudito inglés, envuelto en literatura clásica:

%%time payload = { "inputs": [[ {"role": "system", "content": "You are a profoundly knowledgeable English professor"}, {"role": "user", "content": f"""Extract the main character's name, their main objective, the main obstacle they face, and the setting of the story from the following synopsis: "In a dystopian city of the future, Eleanor seeks to overthrow the oppressive regime that controls every aspect of citizens' lives. However, a shadowy organization called The Enforcers stands in her way, ensuring that no one defies the rulers." """}, ]], "parameters": {"max_new_tokens": 512, "top_p": 0.9, "temperature": 0.6}
}
response = predictor.predict(payload, custom_attributes='accept_eula=true')
print_dialog(payload, response)


System: You are a profoundly knowledgeable English professor User: Extract the main character's name, their main objective, the main obstacle they face, and the setting of the story from the following synopsis: "In a dystopian city of the future, Eleanor seeks to overthrow the oppressive regime that controls every aspect of citizens' lives. However, a shadowy organization called The Enforcers stands in her way, ensuring that no one defies the rulers." ====================================================== Assistant: Ah, a most intriguing synopsis! Let me extract the main elements for you: 1. Main character's name: Eleanor
2. Main objective: To overthrow the oppressive regime that controls every aspect of citizens' lives.
3. Main obstacle: The shadowy organization called The Enforcers, who ensure that no one defies the rulers.
4. Setting: A dystopian city of the future. Ah, a most compelling story! The themes of rebellion, oppression, and the struggle for freedom are timeless and always relevant. The dystopian setting adds an extra layer of complexity and tension, and the presence of The Enforcers creates an ominous and formidable obstacle for Eleanor to overcome. I can already imagine the thrilling plot twists and heart-pumping action that this story will unfold!
====================================================== CPU times: user 4.42 ms, sys: 154 µs, total: 4.58 ms
Wall time: 3.73 s

Escenario 2 – Al priorizar la precisión, el usuario indica al modelo que se adhiera estrictamente a la directiva proporcionada. El resultado es una respuesta concisa y directa, sin elaboraciones innecesarias:

%%time payload = { "inputs": [[ {"role": "system", "content": "You are a profoundly knowledgeable English professor who follows every instruction meticulously."}, {"role": "user", "content": f"""Extract the main character's name, their main objective, the main obstacle they face, and the setting of the story from the following synopsis: "In a dystopian city of the future, Eleanor seeks to overthrow the oppressive regime that controls every aspect of citizens' lives. However, a shadowy organization called The Enforcers stands in her way, ensuring that no one defies the rulers." """}, ]], "parameters": {"max_new_tokens": 512, "top_p": 0.9, "temperature": 0.6}
}
response = predictor.predict(payload, custom_attributes='accept_eula=true')
print_dialog(payload, response)


System: You are a profoundly knowledgeable English professor who follows every instruction meticulously. User: Extract the main character's name, their main objective, the main obstacle they face, and the setting of the story from the following synopsis: "In a dystopian city of the future, Eleanor seeks to overthrow the oppressive regime that controls every aspect of citizens' lives. However, a shadowy organization called The Enforcers stands in her way, ensuring that no one defies the rulers." ====================================================== Assistant: Ah, an intriguing synopsis! Let me extract the main character's name, main objective, main obstacle, and setting for you: 1. Main character's name: Eleanor
2. Main objective: To overthrow the oppressive regime that controls every aspect of citizens' lives.
3. Main obstacle: The shadowy organization called The Enforcers, who ensure that no one defies the rulers.
4. Setting: A dystopian city of the future. Now, if you'll excuse me, I must return to my meticulous marking of student essays. The rigors of academia demand nothing less!
====================================================== CPU times: user 2.16 ms, sys: 1.74 ms, total: 3.89 ms
Wall time: 2.84 s

La distinción entre estos escenarios subraya la naturaleza adaptativa de las indicaciones del sistema. Tiene la flexibilidad de afinarlos, ya sea que el objetivo sea obtener narrativas detalladas u obtener respuestas directas y concisas. A través de estas indicaciones, podrá definir el rango de salida del modelo, asegurando que se alinee perfectamente con la tarea en cuestión.

Resumen de texto y respuesta a preguntas en contexto.

Las indicaciones del sistema también pueden ser sencillas y aplicar el contexto para responder preguntas. En este ejemplo, le pedimos a Llama 2 Chat que asuma la personalidad de un chatbot y que responda preguntas solo del icónico Carta a los accionistas de Amazon de 1997 escrito por Jeff Bezos.

Leemos el texto y lo insertamos dentro del indicador del sistema mediante interpolación de cadenas. El ejemplo completo se muestra en el cuaderno adjunto.

system_prompt = f"""
You are an intelligent chatbot. Answer the questions only using the following context: {Amazon_Shareholder_Letter_1997} Here are some rules you always follow: - Generate human readable output, avoid creating output with gibberish text.
- Generate only the requested output, don't include any other language before or after the requested output.
- Never say thank you, that you are happy to help, that you are an AI agent, etc. Just answer directly.
- Generate professional language typically used in business documents in North America.
- Never generate offensive or foul language. """ user_prompt = "Give me the summary of the shareholder letter"


%%time payload = { "inputs": [[ {"role": "system", "content": system_prompt}, {"role": "user", "content": user_prompt}, ]], "parameters": {"max_new_tokens": 512, "top_p": 0.9, "temperature": 0.6}
}
response = predictor.predict(payload, custom_attributes='accept_eula=true')
print_dialog(payload, response)


User: Give me the summary of the shareholder letter Assistant: Sure, here's the summary of the shareholder letter: Amazon.com's Founder and CEO, Jeffrey P. Bezos, is writing to shareholders to discuss the company's performance in 1997 and its outlook for the future. He highlights the company's success in becoming the market leader in online bookselling, with sales growing from $15.7 million in 1996 to $147.8 million in 1997, and the number of customers growing from 180,000 to 1,510,000. He also discusses the company's focus on customer satisfaction and long-term market leadership, and how this approach has led to bold investment decisions and a lean culture. Bezos also reviews the company's progress in 1997, including the expansion of its business infrastructure, the establishment of long-term relationships with strategic partners, and the growth of its employee base. He outlines the company's goals for 1998, including continuing to solidify and extend its brand and customer base, and plans to add music to its product offering. Overall, the letter emphasizes Amazon.com's focus on customer satisfaction, long-term market leadership, and sustained investment in systems and infrastructure to support growth. ====================================================== CPU times: user 17.9 ms, sys: 1.8 ms, total: 19.7 ms
Wall time: 8.08 s

Como se demostró, Llama 2 Chat puede adherirse a estrictas barreras de seguridad dentro de las indicaciones del sistema que le permiten responder preguntas de un contexto determinado de manera significativa. Esta capacidad allana el camino para técnicas como la recuperación de generación aumentada (RAG), que es bastante popular entre los clientes. Para obtener más información sobre el enfoque RAG con SageMaker, consulte Recuperación de Generación Aumentada (RAG).

Conclusión

La implementación de Llama 2 Chat para lograr un rendimiento sólido requiere tanto experiencia técnica como conocimiento estratégico de su diseño. Para aprovechar plenamente las amplias capacidades del modelo, debe comprender y aplicar técnicas de indicaciones creativas y ajustar los parámetros de inferencia. Esta publicación tiene como objetivo describir métodos efectivos para integrar Llama 2 Chat usando SageMaker. Nos centramos en técnicas y consejos prácticos y explicamos un camino eficaz para que usted utilice las poderosas capacidades de Llama 2 Chat.

Las siguientes son conclusiones clave:

Control dinámico con ambiente. – Los controles de temperatura dentro de Llama 2 Chat desempeñan un papel fundamental mucho más allá de simples ajustes. Actúan como la brújula del modelo, guiando su amplitud creativa y profundidad analítica. Tocar la fibra sensible con estos controles puede llevarlo de un mundo de exploración creativa a uno de resultados precisos y consistentes.
Claridad de comando – A medida que navegamos por el laberinto de tareas que requieren muchos datos, especialmente en ámbitos como la revisión de datos, la precisión de nuestras instrucciones se convierte en nuestra estrella polar. Llama 2 Chat, cuando se guía con lucidez, brilla más, alineando sus vastas capacidades con nuestros propósitos específicos.
Información estructurada – Con su enfoque paso a paso, Llama 2 Chat permite la exploración metódica de grandes cantidades de datos, lo que le permite descubrir patrones matizados e ideas que pueden no ser evidentes a primera vista.

Integrar Llama 2 Chat con SageMaker JumpStart no se trata solo de utilizar una herramienta poderosa, sino de cultivar un conjunto de mejores prácticas adaptadas a sus necesidades y objetivos únicos. Todo su potencial proviene no sólo de comprender las fortalezas de Llama 2 Chat, sino también del perfeccionamiento continuo de la forma en que trabajamos con el modelo. Con el conocimiento de esta publicación, puedes descubrir y experimentar con Llama 2 Chat: tus aplicaciones de IA pueden beneficiarse enormemente a través de esta experiencia práctica.

Recursos

Sobre los autores

Jin Tan Ruan es desarrollador de prototipos dentro del equipo de ingeniería de clientes y prototipos de industrias de AWS (PACE), y se especializa en PNL e IA generativa. Con experiencia en desarrollo de software y nueve certificaciones de AWS, Jin aporta una gran experiencia para ayudar a los clientes de AWS a materializar sus visiones de IA/ML e IA generativa utilizando la plataforma AWS. Tiene una maestría en Ciencias de la Computación e Ingeniería de Software de la Universidad de Syracuse. Fuera del trabajo, a Jin le gusta jugar videojuegos y sumergirse en el apasionante mundo de las películas de terror. Puedes encontrar a Jin en Linkedln. ¡Conectémonos!

Dr. Farooq Sabir es arquitecto sénior de soluciones especialista en inteligencia artificial y aprendizaje automático en AWS. Tiene un doctorado y una maestría en ingeniería eléctrica de la Universidad de Texas en Austin y una maestría en informática del Instituto de Tecnología de Georgia. Tiene más de 15 años de experiencia laboral y también le gusta enseñar y asesorar a estudiantes universitarios. En AWS, ayuda a los clientes a formular y resolver sus problemas comerciales en ciencia de datos, aprendizaje automático, visión artificial, inteligencia artificial, optimización numérica y dominios relacionados. Con sede en Dallas, Texas, a él y a su familia les encanta viajar y hacer viajes largos por carretera.

Pronoy Chopra es arquitecto senior de soluciones en el equipo de Startups AI/ML. Tiene una maestría en ingeniería eléctrica e informática y le apasiona ayudar a las nuevas empresas a construir la próxima generación de aplicaciones y tecnologías en AWS. Le gusta trabajar en el ámbito de la IA generativa y la IoT y anteriormente ayudó a cofundar dos empresas emergentes. Le gustan los juegos, la lectura y la programación de software/hardware en su tiempo libre.

Distribución de relaciones públicas y contenido potenciado por SEO. Consiga amplificado hoy.
PlatoData.Network Vertical Generativo Ai. Empodérate. Accede Aquí.
PlatoAiStream. Inteligencia Web3. Conocimiento amplificado. Accede Aquí.
PlatoESG. Carbón, tecnología limpia, Energía, Ambiente, Solar, Gestión de residuos. Accede Aquí.
PlatoSalud. Inteligencia en Biotecnología y Ensayos Clínicos. Accede Aquí.
Fuente: https://aws.amazon.com/blogs/machine-learning/best-prompting-practices-for-using-the-llama-2-chat-llm-through-amazon-sagemaker-jumpstart/

Inteligencia de datos generativa

Mejores prácticas de indicaciones para utilizar Llama 2 Chat LLM a través de Amazon SageMaker JumpStart | Servicios web de Amazon

Requisitos previos

Técnicas de incitación

Indicación de disparo cero

Indicaciones de pocos disparos

Cadena de pensamientos que incitan

Parámetros de inferencia de Llama 2 Chat

Introducción a las indicaciones del sistema

Demostración práctica

Demostración de contexto profesional.

Resumen de texto y respuesta a preguntas en contexto.

Conclusión

Recursos

Sobre los autores

Principales proyectos de RWA de 2024: ¡Ethena, Ondo y más!

MetaMask y Crypto Tax Calculator se unen para salvar a los criptoinversores en esta temporada de impuestos

Información más reciente

Los científicos están revolucionando la extracción de litio con un tipo diferente de química – CleanTechnica

¿Por qué PBS está difundiendo FUD anti-EV? – CleanTechnica

Actualización de IQT Nordics: Marco Polini, cofundador de Planckian, será el orador de 2024 – Inside Quantum Technology

Bitcoin lidera las ventas de NFT de 30 días, superando a 24 competidores de Blockchain

Impresionantes historias tecnológicas de esta semana en la Web (hasta el 27 de abril)

El fundador de Skywise.ai, Chris McGinty, es orador de la conferencia IQT Quantum + AI 2024 – Inside Quantum Technology