Do we talk?

Gema Parreño Senior Data Scientist

Posted on 19 of July of 2023 .

Community and Artificial Intelligence at Google I/O Connect

In the vibrant city of Amsterdam, an event has brought together more than 500 developers from all over the world for an unrivalled experience: I/O Connect. This event, which has captured the attention of the technology industry, focuses on one of the most exciting and cutting-edge topics of our time: artificial intelligence. I/O Connect has been the perfect stage for technology minds to exchange knowledge, make valuable connections and discover the latest trends that will drive the future of innovation.

Si quiere ver video debe aceptar las cookies de marketing

We have had the opportunity to explore how this technology is transforming business and society at large. From machine learning algorithms to natural language processing, we've discovered Google's newest and most exciting AI tools. In addition, we were given access to live demonstrations and interacted with the latest technological creations.

Why LLMs?

We focus today on LLMs - an acronym for Large Language Models - for two reasons: the many challenges it impacts and the importance of starting to take the first steps towards technical control and moving from Demos to robust Products.

LLMs are Deep Learning models trained on datasets with huge amounts of text-formatted data: can cover a wide range of challenges, such as summarisation, translation, text, and code generation, and even complementing search and recommendation engines.

Bringing it back to reality, we talk about PaLM and its exploration of the parameters that control its API in one of its workshops, to offer the first step in moving from a Demo to a minimum viable product by controlling its parameters using the Python SDK to call the Vertex AI PaLM API. In addition, we explore the Responsible AI principles that this API puts at your disposal to avoid possible hallucinations and offensive generations.

The objective is to understand the possibilities it offers to make it more adaptable to the use cases we are dealing with.

In addition, each description includes the following:

Vertex AI PaLM API models:

The API offers 6 different language models with the first two blocks separated into functional criteria: whether they are language-oriented or code-oriented. The language-oriented ones are:

Text-bison: It is the one that is associated with a lot of functions related to natural language processing tasks, it supports over 8000 input tokens which allows it to associate a decent context, and the training data is up to date as of Feb 2023 - ChatGPT's limitations state that its most used version is as of 2021. This model supports fine-tunning, or retraining.
Chat-bison: A model retrained on Text-bison that supports less initial context, with generation similar to its analogue.
Textembedding-gecko: Embeddings are transformations from text to numerical representation that are used as a prelude to training in language models.

For a first approximation of PaLM, we load the model, pass a prompt, and generate a response.

PALM 2

Figure 1. "Hello PaLM2. The three essential parts of API usage: we load a model, generate a prompt and a response. It is worth noting that the method of loading the model has the same semantics as the Huggin Face library of Transformers.

API control parameters

The initial control parameters are declared in the response as an argument. The text-bison model offers four parameters, to be explored depending on the use case.

Temperature (range (0,1), default 0): used for sampling during response generation and controls the level of stochasticity in token selection. If this parameter is close to 0, it works for indications that require more deterministic and less open-ended responses. In comparison, if it is closer to 1 it can lead to more "creative" or diverse results. A temperature of 0 is decisive: the answer with the highest probability is always selected. For most use cases, it is recommended to start with a value of 0.2. ResponsibleAI tip: It should be noted that while the results may be more creative, they may also generate meaningless or inappropriate text.
Max_output_tokens (range:1-1024, default 128): Is the maximum number of tokens that can be generated in the response. Specifies a lower value for shorter responses and a higher value for longer responses. A token can be smaller than a word. A token is approximately four characters long. 100 tokens correspond to approximately 60-80 words. It is essential to consider the size of the tokens, as the models have a limit on the number of input and output tokens.
Top_p (range 0 to 1, default 0.95): The top_p parameter is used to control the diversity of the generated text and at low level changes how the model selects tokens for output. A higher value of the top_p parameter produces more "diverse" and "interesting" results, as the model is allowed to choose from a wider set of possibilities. In contrast, a lower value of the top_p parameter results in more predictable outputs, as the model is limited to a smaller set of possible tokens. Specify a lower value to avoid randomness.
Top_k (range 0 to 40, default 40): top_k changes the way the model selects tokens for output. A top_k of 1 means that the selected token is the most likely of all tokens in the model's vocabulary (also known as greedy decoding). In contrast, a top_k of 3 means that the next token is selected from the 3 most likely tokens (using temperature). At each token selection step, the top_k tokens with the highest probabilities are sampled. The tokens are then further filtered according to top_p, and the final token is selected by temperature sampling.

generating answer AI

Figure 2. Response loaded with the parameters with the vocation of generating the answer to the question: was it Frodo who destroyed the ring? Documentation on the parameters can be found here.

Sample Notebook here.

Conclusions: So far, we have loaded a model and learned more about the parameters that help us control the model. Finding out more about these parameters you can consult the documentation here. We now move on to the design of the inputs or prompts, with a series of best practices recommended by the Google Cloud team for the best mastery of the API.

Use case: Response generation with Vertex AI within the Question-Answer scenario

As mentioned above, there are multiple use cases that the pre-trained model can solve. Based on the different examples in the repository, we focus on functional question-answer problems. These models can solve problems associated with customer service, website chats, forums, etc.

However, in addition to the model, considering giving the model the optimal promtp can significantly influence the results. For this reason, several good practices are presented. Like the first two key concepts, we find that the prompt must be specific, concise, and rich in context, as well as free of grammatical errors and ask only one question at each prompt. We then classify the type of question depending on the domain:

Open Domain: All questions whose answers are available online. They can belong to any category, such as history, geography, countries, politics, chemistry, etc. These include trivial or general knowledge questions, such as: Q. Who won the Olympic gold medal in swimming? Q. Who is the president of [particular country]? Q. Who wrote [specific book]? Be aware of the training limit of generative models, as questions involving information more recent than the date the model was trained may give incorrect or imaginative answers.
Closed domain: specific questions, which correspond to an internal knowledge base not available on the public Internet. If correctly stated, the model is more likely to respond from the context provided and less likely to give answers beyond what is on the open Internet. If, for example, you want to build a question-and-answer bit based on the full documentation of a product, you can pass the full documentation to the model and ask it to answer only on that basis.

In both open and closed domains, we can use one or several questions depending on the specificity of the domain. In the case of closed domains, we also add a string as a context.

Artificial intelligence history

Figure 3. Within the context of Open Domain, we can pass a series of questions to give context to the model. In this case, geography, and history questions.

There are many particularities that you can explore in the open source repository that the Google Cloud team makes available to you.

In addition to learning, it has been a unique experience that has given us the opportunity to connect with the community. Congratulations to the organising team and to all attendees for their kindness and energy!

Blog

< Go back

Node: portalesizertis-webapp-6896d4b56c-9mdpz:8080

Name	Description	Type	Provider
COOKIE_SUPPORT	This cookie determines whether your browser accepts cookies.	HTTP	izertis.com
INGRESSCOOKIE	Records which pool is serving the visitor. This is used in relation to load balancing to optimize the user's experience.	HTTP	izertis.com
JSESSIONID	Preserves user states on all requests on the page.	HTTP	izertis.com
LFR_SESSION_STATE_20103	Cookie used by the web portal for session time control.	HTTP	izertis.com

Name	Description	Type	Provider	com.cookies.table.life
GUEST_LANGUAGE_ID	Determines the language preferred by the visitor. Allows the web to set the preferred language in the visitor's re-entry.	HTTP	izertis.com
lang	Remember the language selected by a user with which to view a web page.	HTTP	ads.linkedin.com

Name	Description	Type	Provider
_ga	It records a unique identification that is used to generate statistical data about how the visit uses the website.	HTTP	izertis.com
_gat	Used by Google Analytics to control the rate of requests	HTTP	izertis.com
_gid	It records a unique identification that is used to generate statistical data about how the visitor uses the website.	HTTP	izertis.com
bounce	Determines whether a visitor leaves the website immediately - This information is used for internal statistics and analysis by the web operator.	Pixel	adnxs.com
cb	Collect information about the visitor's IP address, geographic location and navigation. This information is used for internal optimization and statistics for the web operator.	Pixel	pixel.prfct.co
cb	Collect information about the visitor's IP address, geographic location and navigation. This information is used for internal optimization and statistics for the web operator.	Pixel	pixel-geo.prfct.co
personalization_id	Set by Twitter - The cookie allows the visitor to share web content on their Twitter profile.	HTTP	twitter.com
seg	It records statistical data of visitor behavior on the web. This is used for internal analysis by the web operator.	Pixel	adnxs.com
seg	It records statistical data of visitor behavior on the web. This is used for internal analysis by the web operator.	Pixel	pixel-geo.prfct.co

Name	Description	Type	Provider
__ss	Marketing cookies are used to track visitors on websites. The intention is to display relevant and engaging ads to the individual user, and therefore more valuable to publishers and third-party advertisers.	HTTP	izertis.com
__ss_referrer	Collect information about visitor behavior on multiple websites - This information is used within the web to optimize the relevance of advertising. The cookie also helps determine how the visitor accesses the website.	HTTP	izertis.com
__ss_tk	Collect information about visitor behavior on multiple websites - This information is used within the web to optimize the relevance of advertising. The cookie also helps determine how the visitor accesses the website.	HTTP	izertis.com
_fbp	Used by Facebook to provide a number of advertising products such as real-time bidding from third-party advertisers.	HTTP	izertis.com
A3	Collect information about visitor behavior on multiple websites - This information is used within the web to optimize the relevance of advertising.	HTTP	yahoo.com
ads/ga-audiences	Used by Google AdWords to reconnect with visitors who have the potential to become customers, it is based on the customer's online behavior through the websites.	Pixel	google.com
anj	Registers a unique ID that identifies the device of a return user. Identification is used for specific ads.	HTTP	adnxs.com
bcookie	Registers a unique ID that identifies the device of a return user. Identification is used for specific ads.	HTTP	linkedin.com
bscookie	Used by the LinkedIn social networking service to track the use of embedded services.	HTTP	linkedin.com
cs	This cookie is used to improve the relevance of advertisements by collecting visitor data on multiple websites. This exchange of visitor data is usually offered by an exchange of advertisements or third-party data center.	Pixel	pixel-geo.prfct.co
d/px	Collects data from user preferences and behavior on the web - This information is used to produce content and advertising more relevant to a particular user.	Pixel	adsymptotic.com
fr	Used by Facebook to provide a number of advertising products such as real-time bidding from third-party advertisers.	HTTP	facebook.com
getuid	The audience manager configures this cookie to determine the time and frequency of visitor data synchronization. Synchronizing the data cookie is used to synchronize and gather visitor data from various websites.	Pixel	adnxs.com
i	It records anonymous user data, such as your IP address, geographic location, websites visited and which ads the user clicked, in order to optimize the display of ads based on the user's movement on websites that use the same ad network.	HTTP	openx.net
i/adsct	The cookie is used Twitter.com to determine the number of visitors accessing we b through Twitter advertising content.	Pixel	twitter.com
IDE	Used by Google DoubleClick to record and report user actions on the website after viewing or clicking on one of the advertiser's ads for the purpose of measuring the effectiveness of an ad and presenting user-specific ads.	HTTP	doubleclick.net
koitk	Used by Google DoubleClick to record and report user actions on the website after viewing or clicking on one of the advertiser's ads for the purpose of measuring the effectiveness of an ad and presenting user-specific ads.	HTTP	marketingautomation.services
lang	Set by LinkedIn when a web page contains an embedded Follow Us panel.	HTTP	linkedin.com
lidc	Used by the LinkedIn social networking service to track the use of embedded services.	HTTP	linkedin.com
lissc	Used by the LinkedIn social networking service to track the use of embedded services.	HTTP	linkedin.com
na_id	Used to recognize the visitor in his re-entry. This allows the website to record the visitor's behavior and facilitate the social media sharing feature provided by Addthis.com.	HTTP	addthis.com
ouid	Sets an identification string for a specific visitor. This is used to recognize the visitor in their re-entry. In addition, it allows the website to record the behavior of the visitor and facilitates the function of sharing on social networks provided by Addthis.com.	HTTP	addthis.com
pa_#_ts	Used on websites that use the same ad network to show ads to others to network nunciants.	HTTP	prfct.co
pa_uid	Used on websites that use the same ad network to show ads to other advertisers on the network.	HTTP	prfct.co
test_cookie	Used to check if the user's browser supports cookies.	HTTP	doubleclick.net
tr	Used by Facebook to provide a number of advertising products such as real-time bidding from third-party advertisers.	Pixel	facebook.com
uid	Creates a unique user ID generated by a machine. AddThis, which is propied ad from Clearspring Technologies, uses user identification to make it possible for the user to share content on various social networks by providing detailed statistics to various providers.	HTTP	addthis.com
usermap	Used to present the visitor with relevant content and advertising - The service is provided by groups of third-party advertising providers, who provide real-time offers to advertisers.	Pixel	pixel-geo.prfct.co
UserMatchHistory	Used to track visitors across multiple websites to present relevant advertising based on visitor preferences.	HTTP	linkedin.com
uuid2	Registers a unique ID that identifies the device of a return user. Identification is used for specific ads.	HTTP	adnxs.com
VISITOR_INFO1_LIVE	Intenta calcular el ancho de banda del usuario en páginas con vídeos de YouTube integrados.	HTTP	youtube.com
w/1.0/sd	Record visitor data such as your IP address, geographic location, and advertising engagement. This information is used to optimize advertising on websites that use OpenX.net.	Pixel	openx.net
YSC	Record a unique ID to maintain statistics of which YouTube videos the user has viewed.	HTTP	youtube.com
yt-remote-cast-installed	Records the user's video player preferences when watching embedded YouTube videos.	HTML	youtube.com
yt-remote-connected-devices	Records the user's video player preferences when watching embedded YouTube videos.	HTML	youtube.com
yt-remote-device-id	Records the user's video player preferences when watching embedded YouTube videos.	HTML	youtube.com
yt-remote-fast-check-period	Records the user's video player preferences when watching embedded YouTube videos.	HTML	youtube.com
yt-remote-session-app	Registra las preferencias del reproductor de vídeo del usuario al ver vídeos incrustados de YouTube.	HTML	youtube.com
yt-remote-session-name	Records the user's video player preferences when watching embedded YouTube videos.	HTML	youtube.com
li_sugr	Used by LinkedIn to track the use of embedded services. The main purpose of this cookie is: Targeting/Advertising.	HTTP	linkedin.com