What happened this week in AI by Louie This week we have been watching development in coding models at META as well as new fine-tuning capabilities at OpenAI. Meta has introduced Code LLaMA, a large language model with the capability to both generate code based on prompts and describe codes. They unveiled three iterations featuring varying parameter counts (7B, 13B, and 34B), each of which has undergone training on an additional 500 billion code-related tokens. The models have compatibility with widely recognized programming languages such as Python, C++, Java, PHP, and others. Furthermore, two specialized models are built on top of it. Code LLaMA - Instruct is the tuned iteration of Code LLaMA, designed to follow instructions. Additionally, there is Code LLaMA - Python, a dedicated model tailored for the Python programming language. The models have been made available under licenses for both research and commercial use. The open source release of this model enables rapid iteration and already we have seen further models built on top of it including
This AI newsletter is all you need #62
This AI newsletter is all you need #62
This AI newsletter is all you need #62
What happened this week in AI by Louie This week we have been watching development in coding models at META as well as new fine-tuning capabilities at OpenAI. Meta has introduced Code LLaMA, a large language model with the capability to both generate code based on prompts and describe codes. They unveiled three iterations featuring varying parameter counts (7B, 13B, and 34B), each of which has undergone training on an additional 500 billion code-related tokens. The models have compatibility with widely recognized programming languages such as Python, C++, Java, PHP, and others. Furthermore, two specialized models are built on top of it. Code LLaMA - Instruct is the tuned iteration of Code LLaMA, designed to follow instructions. Additionally, there is Code LLaMA - Python, a dedicated model tailored for the Python programming language. The models have been made available under licenses for both research and commercial use. The open source release of this model enables rapid iteration and already we have seen further models built on top of it including