OpenAI Blog

We’re bringing the Financial Times’ world-class journalism to ChatGPT

OpenAI — Fri, 26 Apr 2024 20:36:55 GMT

Editor’s note: This news was originally shared by the Financial Times and can be read here.

The Financial Times today announced a strategic partnership and licensing agreement with OpenAI, a leader in artificial intelligence research and deployment, to enhance ChatGPT with attributed content, help improve its models’ usefulness by incorporating FT journalism, and collaborate on developing new AI products and features for FT readers.

Through the partnership, ChatGPT users will be able to see select attributed summaries, quotes and rich links to FT journalism in response to relevant queries.

In addition, the FT became a customer of ChatGPT Enterprise earlier this year, purchasing access for all FT employees to ensure its teams are well-versed in the technology and can benefit from the creativity and productivity gains made possible by OpenAI’s tools.

“This is an important agreement in a number of respects,” said FT Group CEO John Ridding. “It recognises the value of our award-winning journalism and will give us early insights into how content is surfaced through AI. We have long been a leader in news media innovation, pioneering the subscription model and engagement technologies, and this partnership will help to keep us at the forefront of developments in how people access and use information.”

“The FT is committed to human journalism, as produced by our unrivalled newsroom, and this agreement will broaden the reach of that work, while deepening our understanding of reader demands and interests,” Ridding added. “Apart from the benefits to the FT, there are broader implications for the industry. It’s right, of course, that AI platforms pay publishers for the use of their material. OpenAI understands the importance of transparency, attribution, and compensation – all essential for us. At the same time, it’s clearly in the interests of users that these products contain reliable sources.”

Brad Lightcap, COO of OpenAI, expressed enthusiasm about the evolving relationship with the Financial Times, stating: “Our partnership and ongoing dialogue with the FT is about finding creative and productive ways for AI to empower news organisations and journalists, and enrich the ChatGPT experience with real-time, world-class journalism for millions of people around the world.”

"We're keen to explore the practical outcomes regarding news sources and AI through this partnership,” said Ridding. “We value the opportunity to be inside the development loop as people discover content in new ways. As with any transformative technology, there is potential for significant advancements and major challenges, but what’s never possible is turning back time. It’s important for us to represent quality journalism as these products take shape – with the appropriate safeguards in place to protect the FT’s content and brand.

We have always embraced new technologies and disruption, and we’ll continue to operate with both curiosity and vigilance as we navigate this next wave of change.”

Introducing more enterprise-grade features for API customers

OpenAI — Mon, 22 Apr 2024 20:51:50 GMT

We work with many enterprises like Klarna, Morgan Stanley, Oscar, Salesforce, and Wix to help them build AI solutions from scratch and safely deploy AI across their organizations and products. We’re deepening our support for enterprises with new features that are useful for both large businesses and any developers who are scaling quickly on our platform.

Enhanced enterprise-grade security

We’ve introduced Private Link, a new way that customers can ensure direct communication between Azure and OpenAI while minimizing exposure to the open internet. We’ve also released native Multi-Factor Authentication (MFA) to help ensure compliance with increasing access control requirements. These are new additions to our existing stack of enterprise security features including SOC 2 Type II certification, single sign-on (SSO), data encryption at rest using AES-256 and in transit using TLS 1.2, and role-based access controls. We also offer Business Associate Agreements for healthcare companies that require HIPAA compliance and a zero data retention policy for API customers with a qualifying use case.

Better administrative control

With our new Projects feature, organizations will have more granular control and oversight over individual projects in OpenAI. This includes the ability to scope roles and API keys to specific projects, restrict/allow which models to make available, and set usage- and rate-based limits to give access and avoid unexpected overages. Project owners will also have the ability to create service account API keys, which give access to projects without being tied to an individual user.

Assistants API improvements

We’ve introduced several updates to the Assistants API for more accurate retrieval, flexibility around model behavior and tools used to complete tasks, and better control over costs. These features include:

Improved retrieval with ‘file_search’ which can ingest up to 10,000 files per assistant—a 500x increase from the previous file limit of 20. The tool is faster, supports parallel queries through multi-threaded searches, and has enhanced reranking and query rewriting.
Streaming support for real-time, conversational responses—one of the top requests from developers and enterprises.
New ‘vector_store’ objects in the API so files can be added to a vector store and automatically parsed, chunked, and embedded in preparation for file search. Vector stores can be used across assistants and threads, simplifying file management and billing.
Control over the maximum number of tokens used per run, plus limits on previous and recent messages used in each run, so you can manage token usage costs.
New ‘tool_choice’ parameter to select a specific tool (like ‘file_search’, ‘code_interpreter’, or ‘function’) in a particular run.
Support for fine-tuned GPT-3.5 Turbo models in the API (to start, we’ll support fine-tunes of ‘gpt-3.5-turbo-0125’).

More options for cost management

To help organizations scale their AI usage without over-extending their budgets, we’ve added two new ways to reduce costs on consistent and asynchronous workloads:

Discounted usage on committed throughput: Customers with a sustained level of tokens per minute (TPM) usage on GPT-4 or GPT-4 Turbo can request access to provisioned throughput to get discounts ranging from 10–50% based on the size of the commitment.
Reduced costs on asynchronous workloads: Customers can use our new Batch API to run non-urgent workloads asynchronously. Batch API requests are priced at 50% off shared prices, offer much higher rate limits, and return results within 24 hours. This is ideal for use cases like model evaluation, offline classification, summarization, and synthetic data generation.

We plan to keep adding new features focused on enterprise-grade security, administrative controls, and cost management. For more information on these launches, visit our API documentation or get in touch with our team to discuss custom solutions for your enterprise.

OpenAI’s commitment to child safety: adopting safety by design principles

OpenAI — Mon, 22 Apr 2024 19:34:22 GMT

OpenAI, alongside industry leaders including Amazon, Anthropic, Civitai, Google, Meta, Metaphysic, Microsoft, Mistral AI, and Stability AI, has committed to implementing robust child safety measures in the development, deployment, and maintenance of generative AI technologies as articulated in the Safety by Design principles. This initiative, led by Thorn, a nonprofit dedicated to defending children from sexual abuse, and All Tech Is Human, an organization dedicated to tackling tech and society's complex problems, aims to mitigate the risks generative AI poses to children. By adopting comprehensive Safety by Design principles, OpenAI and our peers are ensuring that child safety is prioritized at every stage in the development of AI. To date, we have made significant effort to minimize the potential for our models to generate content that harms children, set age restrictions for ChatGPT, and actively engage with the National Center for Missing and Exploited Children (NCMEC), Tech Coalition, and other government and industry stakeholders on child protection issues and enhancements to reporting mechanisms.

As part of this Safety by Design effort, we commit to:

Develop: Develop, build, and train generative AI models that proactively address child safety risks.
- Responsibly source our training datasets, detect and remove child sexual abuse material (CSAM) and child sexual exploitation material (CSEM) from training data, and report any confirmed CSAM to the relevant authorities.
- Incorporate feedback loops and iterative stress-testing strategies in our development process.
- Deploy solutions to address adversarial misuse.
Deploy: Release and distribute generative AI models after they have been trained and evaluated for child safety, providing protections throughout the process.
- Combat and respond to abusive content and conduct, and incorporate prevention efforts.
- Encourage developer ownership in safety by design.
Maintain: Maintain model and platform safety by continuing to actively understand and respond to child safety risks.
- Committed to removing new AIG-CSAM generated by bad actors from our platform.
- Invest in research and future technology solutions.
- Fight CSAM, AIG-CSAM and CSEM on our platforms.

This commitment marks an important step in preventing the misuse of AI technologies to create or spread child sexual abuse material (AIG-CSAM) and other forms of sexual harm against children. As part of the working group, we have also agreed to release progress updates every year.

We care deeply about the safety and responsible use of our tools, which is why we’ve built strong guardrails and safety measures into ChatGPT and DALL-E. We are committed to working alongside Thorn, All Tech is Human and the broader tech community to uphold the Safety by Design principles and continue our work in mitigating potential harms to children.

Chelsea Carlson, Child Safety TPM

This collective action underscores our shared approach to child safety, demonstrating a shared commitment to ethical innovation and the well-being of the most vulnerable members of society. Thorn has published the principles at https://teamthorn.co/gen-ai

Introducing OpenAI Japan

OpenAI — Fri, 12 Apr 2024 19:09:59 GMT

Editor’s note: Japanese follows English (日本語は英語の後に続きます)

As we grow our operations internationally, we’re expanding into Asia with a new office in Tokyo, Japan. We are committed to collaborating with the Japanese government, local businesses, and research institutions to develop safe AI tools that serve Japan’s unique needs and to unlock new opportunities. We chose Tokyo as our first Asian office for its global leadership in technology, culture of service, and a community that embraces innovation.

We’re excited to be in Japan which has a rich history of people and technology coming together to do more. We believe AI will accelerate work by empowering people to be more creative and productive, while also delivering broad value to current and new industries that have yet to be imagined.

Sam Altman, CEO of OpenAI

To spearhead our efforts in Japan and ensure we are deeply integrated within the local community, we welcome Tadao Nagasaki as our new President of OpenAI Japan. Mr. Nagasaki will lead our commercial and market engagement efforts and help build our local team that will advance Global Affairs, Go-to-Market, Communications, Operations and other functions in serving Japan.

As a first step in our long-term commitment to the region, we’re providing local businesses with early access to a GPT-4 custom model specifically optimized for the Japanese language. This custom model offers improved performance in translating and summarizing Japanese text, is cost effective, and operates up to 3x faster than its predecessor. Speak, a top English learning app in Japan, is seeing 2.8x faster tutor explanations in Japanese when users make a mistake with a 47% reduction in token cost, unlocking higher quality tutor feedback in more places and with higher limits per user. We plan to release the custom model more broadly in the API in the coming months.

We are releasing a GPT-4 custom model optimized for the Japanese language which offers improved performance in Japanese text and operates up to 3x faster than GPT-4 Turbo.

Our new local presence also gets us closer to leading businesses like Daikin, Rakuten, and TOYOTA Connected who are using ChatGPT Enterprise to automate complex business processes, assist in data analysis, and optimize internal reporting. ChatGPT also helps accelerate the efforts of local governments, such as Yokosuka City, which is leveraging the technology to improve the efficiency of public services in Japan. Over the past year, the city has gradually provided ChatGPT access to almost all city employees, and 80% have reported increases in productivity. Now Yokosuka City has formed a network with 21 local governments—including the Tokyo Metropolitan Government and the City of Kobe—to share best practices of ChatGPT use in government.

As a key global voice on AI policy, the Japanese government chaired the G7 Hiroshima AI Process and worked to implement AI policies that align with its goals for human dignity, diversity and inclusion, and sustainable societies, while helping Japan realize solutions to its rural depopulation and labor shortage. We look forward to contributing to the local ecosystem, while exploring how AI can help with these societal challenges in the region.

Growing our presence across the world allows us to learn from a wide range of diverse perspectives, which is critical to our mission of ensuring AGI benefits all of humanity. If you are interested in joining us, please see our Careers page for all open positions.

OpenAI Japan 始動

東京にアジア初のオフィスを開設するとともに、日本語に最適化されたGPT-4カスタムモデルの提供を開始します。

OpenAI がグローバルに事業を拡大する中、本日、東京に新しいオフィスを設立し、アジアへと展開していきます。アジアでの最初の拠点として技術、サービスの文化、イノベーションを受け入れるコミュニティにおいて、世界をリードする東京を選びました。日本の独自のニーズに応える安全なAIツールの開発を目指し、政府、地元企業、研究機関と協力していくことに尽力していきます。

「日本にオフィスを開設できたことを嬉しく思います。日本は長い歴史を通じ、人々と技術が協力し、大変多くのことを成し遂げています。AIが、人々をより創造的で生産的になるのを助け、まだ想像されていない新しい産業にも広範囲に価値を提供することを加速できると信じています。」- サム・アルトマン、OpenAI CEO

OpenAI の日本における活動をリードし、日本のコミュニティに深く溶け込んで貢献するため、長﨑忠雄がOpenAI Japanの社長に着任しました。長﨑は、セールスと事業開発をリードし、併せて渉外、製品およびサービスに関する計画、コミュニケーション、オペレーションなどを担うチームを構築していきます。

日本への長期的なコミットメントの第一歩として、私たちは日本の企業に日本語に特化して最適化されたGPT-4カスタムモデルの提供を開始しています。このカスタムモデルは、日本語のテキストの翻訳と要約のパフォーマンス、およびコスト効率を向上させ、前モデルと比較して、最大3倍高速に動作します。この技術を活用した日本で最も利用されている英語学習アプリ「Speak」は、ユーザーが間違えた際のチューター（指導者）の説明が2.8倍速くなりました。トークン数が減り、効率化されたことでそのコストが47％削減されています。より多くの場所でより高品質なフィードバックが可能になりました。

このカスタムモデルは、数か月以内にAPIで広くリリースされる予定です。

写真キャプション：GPT-4のカスタムモデルを日本語向けに最適化しました。日本語テキストの性能向上と、GPT-4 Turboより最大3倍高速な動作を提供します。

日本においてはすでに、ダイキン、楽天、トヨタコネクテッドなどの日本の主要企業に導入され、ChatGPTエンタープライズを利用して複雑なビジネスプロセスの自動化、データ分析の支援、社内報告の最適化を図っています。また、ChatGPTは、横須賀市などの地方自治体に活用され、地域の公共サービスの生産性向上に貢献しています。横須賀市によると、過去1年間、全市職員のほとんどにChatGPTのアクセスを段階的に提供し、80%が生産性の向上を報告されています。現在、横須賀市は東京都や神戸市を含む21の地方自治体とネットワークを形成し、行政におけるChatGPT使用に関するベストプラクティスを共有しています。

AI政策における世界の主要な声として、日本政府はG7広島AIプロセスを主導し、人間の尊厳、多様性と包摂、持続可能な社会という目標に合致するAI政策の実施に取り組んでいます。まずは地方の過疎化と労働力不足への解決策を実現していくことでしょう。OpenAI もこのエコシステムに貢献し、日本の社会的課題に対してAIがどのように役立つかを探求していくことを楽しみにしています。

私たちが成長し、日本を含む世界で存在感を高めることで、私たちは多様な視点から学ぶことができます。それは人類全体にAGIの利益を確実にするという私たちの使命にとって、きわめて重要です。

OpenAI Japanでは、一緒に働く仲間を募集しています。詳細は日本の採用ページで公開していきますのでぜひご覧ください。

Introducing improvements to the fine-tuning API and expanding our custom models program

OpenAI — Wed, 03 Apr 2024 17:44:32 GMT

There are a variety of techniques that developers can use to increase model performance in an effort to reduce latency, improve accuracy, and reduce costs. Whether it’s extending model knowledge with retrieval-augmented generation (RAG), customizing a model’s behavior with fine-tuning, or building a custom-trained model with new domain-specific knowledge, we have developed a range of options to support our customers’ AI implementations. Today, we’re launching new features to give developers more control over fine-tuning with the API and introducing more ways to work with our team of AI experts and researchers to build custom models.

New fine-tuning API features

We launched the self-serve fine-tuning API for GPT-3.5 in August 2023. Since then, thousands of organizations have trained hundreds of thousands of models using our API. Fine-tuning can help models deeply understand content and augment a model’s existing knowledge and capabilities for a specific task. Our fine-tuning API also supports a larger volume of examples than can fit in a single prompt to achieve higher quality results while reducing cost and latency. Some of the common use cases of fine-tuning include training a model to generate better code in a particular programming language, to summarize text in a specific format, or to craft personalized content based on user behavior.

For example, Indeed, a global job matching and hiring platform, wants to simplify the hiring process. As part of this, Indeed launched a feature that sends personalized recommendations to job seekers, highlighting relevant jobs based on their skills, experience, and preferences. They fine-tuned GPT-3.5 Turbo to generate higher quality and more accurate explanations. As a result, Indeed was able to improve cost and latency by reducing the number of tokens in prompt by 80%. This let them scale from less than one million messages to job seekers per month to roughly 20 million.

Today, we’re introducing new features to give developers even more control over their fine-tuning jobs, including:

Epoch-based Checkpoint Creation: Automatically produce one full fine-tuned model checkpoint during each training epoch, which reduces the need for subsequent retraining, especially in the cases of overfitting
Comparative Playground: A new side-by-side Playground UI for comparing model quality and performance, allowing human evaluation of the outputs of multiple models or fine-tune snapshots against a single prompt
Third-party Integration: Support for integrations with third-party platforms (starting with Weights and Biases this week) to let developers share detailed fine-tuning data to the rest of their stack
Comprehensive Validation Metrics: The ability to compute metrics like loss and accuracy over the entire validation dataset instead of a sampled batch, providing better insight on model quality
Hyperparameter Configuration: The ability to configure available hyperparameters from the Dashboard (rather than only through the API or SDK)
Fine-Tuning Dashboard Improvements: Including the ability to configure hyperparameters, view more detailed training metrics, and rerun jobs from previous configurations

Expanding our Custom Models Program

Assisted Fine-Tuning

At DevDay last November, we announced a Custom Model program designed to train and optimize models for a specific domain, in partnership with a dedicated group of OpenAI researchers. Since then, we've met with dozens of customers to assess their custom model needs and evolved our program to further maximize performance.

Today, we are formally announcing our assisted fine-tuning offering as part of the Custom Model program. Assisted fine-tuning is a collaborative effort with our technical teams to leverage techniques beyond the fine-tuning API, such as additional hyperparameters and various parameter efficient fine-tuning (PEFT) methods at a larger scale. It’s particularly helpful for organizations that need support setting up efficient training data pipelines, evaluation systems, and bespoke parameters and methods to maximize model performance for their use case or task.

For example, SK Telecom, a telecommunications operator serving over 30 million subscribers in South Korea, wanted to customize a model to be an expert in the telecommunications domain with an initial focus on customer service. They worked with OpenAI to fine-tune GPT-4 to improve its performance in telecom-related conversations in the Korean language. Over the course of multiple weeks, SKT and OpenAI drove meaningful performance improvement in telecom customer service tasks—a 35% increase in conversation summarization quality, a 33% increase in intent recognition accuracy, and an increase in satisfaction scores from 3.6 to 4.5 (out of 5) when comparing the fine-tuned model to GPT-4.

Custom-Trained Model

In some cases, organizations need to train a purpose-built model from scratch that understands their business, industry, or domain. Fully custom-trained models imbue new knowledge from a specific domain by modifying key steps of the model training process using novel mid-training and post-training techniques. Organizations that see success with a fully custom-trained model often have large quantities of proprietary data—millions of examples or billions of tokens—that they want to use to teach the model new knowledge or complex, unique behaviors for highly specific use cases.

For example, Harvey, an AI-native legal tool for attorneys, partnered with OpenAI to create a custom-trained large language model for case law. While foundation models were strong at reasoning, they lacked the extensive knowledge of legal case history and other knowledge required for legal work. After testing out prompt engineering, RAG, and fine-tuning, Harvey worked with our team to add the depth of context needed to the model—the equivalent of 10 billion tokens worth of data. Our team modified every step of the model training process, from domain-specific mid-training to customizing post-training processes and incorporating expert attorney feedback. The resulting model achieved an 83% increase in factual responses and attorneys preferred the customized model’s outputs 97% of the time over GPT-4.

What’s next for model customization

We believe that in the future, the vast majority of organizations will develop customized models that are personalized to their industry, business, or use case. With a variety of techniques available to build a custom model, organizations of all sizes can develop personalized models to realize more meaningful, specific impact from their AI implementations. The key is to clearly scope the use case, design and implement evaluation systems, choose the right techniques, and be prepared to iterate over time for the model to reach optimal performance.

With OpenAI, most organizations can see meaningful results quickly with the self-serve fine-tuning API. For any organizations that need to more deeply fine-tune their models or imbue new, domain-specific knowledge into the model, our Custom Model programs can help.

Visit our fine-tuning API docs to start fine-tuning our models. For more information on how we can help customize models for your use case, reach out to us.

Start using ChatGPT instantly

OpenAI — Wed, 27 Mar 2024 18:16:27 GMT

It's core to our mission to make tools like ChatGPT broadly available so that people can experience the benefits of AI. More than 100 million people across 185 countries use ChatGPT weekly to learn something new, find creative inspiration, and get answers to their questions. Starting today, you can use ChatGPT instantly, without needing to sign-up. We're rolling this out gradually, with the aim to make AI accessible to anyone curious about its capabilities.

We may use what you provide to ChatGPT to improve our models for everyone. If you’d like, you can turn this off through your Settings - whether you create an account or not. Learn more about how we use content to train our models and your choices in our Help Center.

We’ve also introduced additional content safeguards for this experience, such as blocking prompts and generations in a wider range of categories.

There are many benefits to creating an account including the ability to save and review your chat history, share chats, and unlock additional features like voice conversations and custom instructions.

For anyone that has been curious about AI’s potential but didn’t want to go through the steps to set-up an account, start using ChatGPT today.

Sora: first impressions

OpenAI — Fri, 22 Mar 2024 16:48:46 GMT

Since we introduced Sora to the world last month, we’ve been working with visual artists, designers, creative directors and filmmakers to learn how Sora might aid in their creative process.

Sora is at its most powerful when you’re not replicating the old but bringing to life new and impossible ideas we would have otherwise never had the opportunity to see.

Paul Trillo, Director

While we have many improvements to make to Sora, we're already getting a glimpse of how the model can help creatives bring ideas to reality.

As great as Sora is at generating things that appear real - what excites us is its ability to make things that are totally surreal.

shy kids

Below are a few examples of the artists’ work, with early thoughts from them on how they see Sora fitting into their workflows and businesses.

shy kids – “Air Head”

Based in Toronto, shy kids are a multimedia production company who utilized Sora for their short film about a balloon man. “We now have the ability to expand on stories we once thought impossible,” shares the trio made up of Walter Woodman, Sidney Leeder and Patrick Cederberg. Walter, who directed Air Head, remarks that “as great as Sora is at generating things that appear real, what excites us is its ability to make things that are totally surreal. A new era of abstract expressionism.” Speaking to the wider industry, “people from all over the world with stories ready to burst out of their chests finally have the opportunity to show the world what’s inside.”

Paul Trillo, Director

Paul Trillo is a multi-disciplinary artist, writer, and director whose work has earned accolades from outlets like Rolling Stone and The New Yorker. Paul has garnered 19 Vimeo Staff Picks, an honor given to the best short films hosted on Vimeo. “Working with Sora is the first time I’ve felt unchained as a filmmaker,” he states. “Not restricted by time, money, other people’s permission, I can ideate and experiment in bold and exciting ways.” His experimental videos reflect this approach. “Sora is at its most powerful when you’re not replicating the old but bringing to life new and impossible ideas we would have otherwise never had the opportunity to see.”

Nik Kleverov, Creative Director / Native Foreign

Native Foreign is an Emmy-nominated creative agency from Los Angeles, California specializing in brand storytelling, motion and title design, and generative AI workflows. Co-Founder Nik Kleverov, who is using Sora “to visualize concepts and rapidly iterate on creative for brand partners,” suggests that budgetary restraints no longer have to entirely shape the narrative of creativity. “I’m one of those creatives that thinks in motion, so when I’m in Sora it really feels like I can bring any idea to life.”

August Kamp, Artist/Musician

August Kamp is a musician, researcher, creative activist and multidisciplinary artist. “Sora represents a real turning point for me as an artist whose scope has always been limited by imagination being at odds with means,” she explains. “Being able to build and iterate on cinematic visuals this intuitively has opened up categorically new lanes of artistry to me...I truly cannot wait to see what other forms of storytelling will come into reach with the future of these tools."

Josephine Miller, Creative Director

Josephine Miller is the Co-Founder and Creative Director of London based Oraar Studio, specializing in the design of 3D visuals, augmented reality and digital fashion. "Sora has opened up the potential to bring to life ideas I've had for years, ideas that were previously technically impossible,” she states. “The ability to rapidly conceptualize at such a high level of quality is not only challenging my creative process but also helping me evolve in storytelling. It's enabling me to translate my imagination with fewer technical constraints."

Don Allen Stevenson III, Digital AR/XR Artist

Starting his career at DreamWorks Animation, Don Allen III is a multidisciplinary creator, speaker and consultant who collaborates with major tech and entertainment companies on mixed reality, virtual reality and AI applications. “For a long time I've been making augmented reality hybrid creatures that I think would be fun combinations in my head. Now I have a much easier way of prototyping the ideas before I fully build out the 3-D characters to place in spatial computers.” Don cites Sora’s “weirdness” as its greatest strength: “It’s not bound by traditional laws of physics or conventions of thought.” He says that working with Sora shifted his focus from “technical hurdles to pure creativity…unlocking a world of instant visualization and rapid prototyping.” At the same time, Don says “I feel like this allows me to focus more of my time and energy in the right places… and the emotional impact that I would like my characters to have.”

Alex Reben, Sculptor/Artist and OpenAI’s Artist In Residence

Alexander Reben is an artist who has spent the last decade creating work that explores the humor and absurdity of human nature in artificial intelligence. Alex has been creating sculptures that originate from AI-generated imagery, manually transforming those AI creations into 3D models materialized in the physical world. “My experience of using Sora was as a starting point to develop 3D sculpture. My thoughts drifted towards exploring the realm of photogrammetry and its potential applications to sculpture. The prospect of transforming video into 3D models intrigued me, as it hinted at propelling the AI system beyond its initial scope.”

Global news partnerships: Le Monde and Prisa Media

OpenAI — Wed, 13 Mar 2024 00:21:13 GMT

We are continually making improvements to ChatGPT and are supporting the essential role of the news industry in delivering real-time, authoritative information to users. We’re excited to announce partnerships with Le Monde and Prisa Media along with its publications like El País, Cinco Días, As, and El Huffpost. Our partnerships will enable ChatGPT users to engage with Le Monde and Prisa Media’s high-quality content on recent events in ChatGPT, and their content will also contribute to the training of our models.

Brad Lightcap, COO of OpenAI, said, "We're dedicated to supporting journalism by applying new AI technologies and enhancing opportunities for content creators. In partnership with Le Monde and Prisa Media, our goal is to enable ChatGPT users around the world to connect with the news in new ways that are interactive and insightful.”

In partnership with Le Monde and Prisa Media, our goal is to enable ChatGPT users around the world to connect with the news in new ways that are interactive and insightful.

Brad Lightcap, COO of OpenAI

Over the coming months, ChatGPT users will be able to interact with relevant news content from these publishers through select summaries with attribution and enhanced links to the original articles, giving users the ability to access additional information or related articles from their news sites.

Echoing this sentiment, Louis Dreyfus, CEO of Le Monde, stated, "At the moment we are celebrating the 80th anniversary of Le Monde, this partnership with OpenAI allows us to expand our reach and uphold our commitment to providing accurate, verified, balanced news stories at scale. Collaborating with OpenAI ensures that our authoritative content can be accessed and appreciated by a broader, more diverse audience.

Every shift in the media landscape has presented Le Monde with new opportunities. From the transition to digital platforms to embracing the era of free media, Le Monde has consistently seized these moments to underscore its commitment to independence, expertise, and journalistic integrity.

Since 2010, Le Monde has emerged as a digital media trailblazer, adapting its organizational structure and operational methods while steadfastly adhering to its core principles. By 2024, Le Monde has established itself as France's leading news outlet, boasting more than 600,000 subscribers, 2.2M unique users a day and generating over 632 million page views per month.

Our partnership with OpenAI is a strategic move to ensure the dissemination of reliable information to AI users, safeguarding our journalistic integrity and revenue streams in the process.”

Carlos Nuñez, Chairman and CEO of Prisa Media added, “Joining forces with OpenAI opens new avenues for us to engage with our audience. Leveraging ChatGPT's capabilities allows us to present our in-depth, quality journalism in novel ways, reaching individuals who seek credible and independent content. This is a definite step towards the future of news, where technology and human expertise merge to enrich the reader's experience.

This is a new chapter in Prisa Media’s digital journey, where we are continuously improving our position as the largest Hispanic mediahouse, operating the leading media brands in our core markets: Spain, Latam and USA. We have developed a reach of more than 7 million daily unique users with over 1,650 million page views per month and a clear focus on developing content in digital formats beyond text, both in audio, where we provide 90 million total listening hours and 51 million audio downloads per month, and in video, with more than 141 million monthly video views.”

Our partnerships with Le Monde and Prisa Media, as well as Axel Springer, help empower news organizations to reach audiences in new ways. They build on our collaborations with American Journalism Project to support innovative local news initiatives, and The Associated Press, which contributes to the training of our models. Our partnerships underscore our vision to develop advanced AI tools that empower industries, such as journalism, and solve problems that are otherwise out of reach.

OpenAI announces new members to board of directors

OpenAI — Fri, 08 Mar 2024 19:46:26 GMT

We’re announcing three new members to our Board of Directors as a first step towards our commitment to expansion: Dr. Sue Desmond-Hellmann, former CEO of the Bill and Melinda Gates Foundation, Nicole Seligman, former EVP and General Counsel at Sony Corporation and Fidji Simo, CEO and Chair of Instacart. Additionally, Sam Altman, CEO, will rejoin the OpenAI Board of Directors.

Sue, Nicole and Fidji have experience in leading global organizations and navigating complex regulatory environments, including backgrounds in technology, nonprofit and board governance. They will work closely with current board members Adam D’Angelo, Larry Summers and Bret Taylor as well as Sam and OpenAI’s senior management.

Bret Taylor, Chair of the OpenAI board, stated, “I am excited to welcome Sue, Nicole, and Fidji to the OpenAI Board of Directors. Their experience and leadership will enable the Board to oversee OpenAI’s growth, and to ensure that we pursue OpenAI’s mission of ensuring artificial general intelligence benefits all of humanity.”

Dr. Sue Desmond-Hellmann is a non-profit leader and physician. Dr. Desmond-Hellmann currently serves on the Boards of Pfizer and the President’s Council of Advisors on Science and Technology. She previously was a Director at Proctor and Gamble, Meta (Facebook), and the Bill & Melinda Gates Medical Research institute. She served as the Chief Executive Officer of the Bill & Melinda Gates Foundation from 2014 to 2020. From 2009-2014 she was Professor and Chancellor of the University of California, San Francisco (UCSF), the first woman to hold the position. She also previously served as President of Product Development at Genentech, where she played a leadership role in the development of the first gene-targeted cancer drugs.

Nicole Seligman is a globally recognized corporate and civic leader and lawyer. She currently serves on three public company corporate boards - Paramount Global, MeiraGTx Holdings PLC, and Intuitive Machines, Inc. Seligman held several senior leadership positions at Sony entities, including EVP and General Counsel at Sony Corporation, where she oversaw functions including global legal and compliance matters. She also served as President of Sony Entertainment, Inc., and simultaneously served as President of Sony Corporation of America. Seligman also currently holds nonprofit leadership roles at the Schwarzman Animal Medical Center and The Doe Fund in New York City. Previously, Seligman was a partner in the litigation practice at Williams & Connolly LLP in Washington, D.C., working on complex civil and criminal matters and counseling a wide range of clients, including President William Jefferson Clinton and Hillary Clinton. She served as a law clerk to Justice Thurgood Marshall on the Supreme Court of the United States.

Fidji Simo is a consumer technology industry veteran, having spent more than 15 years leading the operations, strategy and product development for some of the world’s leading businesses. She is the Chief Executive Officer and Chair of Instacart. She also serves as a member of the Board of Directors at Shopify. Prior to joining Instacart, Simo was Vice President and Head of the Facebook App. Over the last decade at Facebook, she oversaw the Facebook App, including News Feed, Stories, Groups, Video, Marketplace, Gaming, News, Dating, Ads and more. Simo founded the Metrodora Institute, a multidisciplinary medical clinic and research foundation dedicated to the care and cure of neuroimmune axis disorders and serves as President of the Metrodora Foundation.

Review completed & Altman, Brockman to continue to lead OpenAI

OpenAI — Fri, 08 Mar 2024 19:46:19 GMT

The Special Committee of the OpenAI Board today announced the completion of the review by WilmerHale. The firm conducted dozens of interviews with members of OpenAI’s prior Board, OpenAI executives, advisors to the prior Board, and other pertinent witnesses; reviewed more than 30,000 documents; and evaluated various corporate actions. Based on the record developed by WilmerHale and following the recommendation of the Special Committee, the Board expressed its full confidence in Mr. Sam Altman and Mr. Greg Brockman’s ongoing leadership of OpenAI.

“We have unanimously concluded that Sam and Greg are the right leaders for OpenAI,” stated Bret Taylor, Chair of the OpenAI Board.

Sam Altman, as CEO, will rejoin the OpenAI Board of Directors.

The OpenAI Board also announced today the election of three new Board members as one part of its commitment to expansion, including:

Dr. Sue Desmond-Hellmann, former CEO of the Bill and Melinda Gates Foundation and on the Board of Directors at Pfizer and on the President’s Council of Advisors on Science and Technology.
Nicole Seligman, former EVP and Global General Counsel of Sony and President of Sony Entertainment and on the Board of Directors at Paramount Global, Meira GTx, and Intuitive Machines, Inc.
Fidji Simo, CEO and Chair of Instacart and on the Board of Directors at Shopify

The new members have experience in leading global organizations and navigating complex regulatory environments, including backgrounds in technology, nonprofit and board governance. They will work closely with current board members Adam D’Angelo, Larry Summers and Bret Taylor as well as Greg, Sam, and OpenAI’s senior management.

Taylor further stated, “As Chair of the Board, I am excited to welcome Sue, Nicole, and Fidji to the OpenAI Board of Directors. Their experience and leadership will enable the Board to oversee OpenAI’s growth and to ensure that we pursue OpenAI’s mission of ensuring artificial general intelligence benefits all of humanity.”

The Board also announced the adoption of important improvements to OpenAI’s governance structure. Key enhancements include:

Adopting a new set of corporate governance guidelines;
Strengthening OpenAI’s Conflict of Interest Policy;
Creating a whistleblower hotline to serve as an anonymous reporting resource for all OpenAI employees and contractors; and
Creating additional Board committees, including a Mission & Strategy committee focused on implementation and advancement of the core mission of OpenAI.

The expanded board will prioritize its crucial work to enhance the governance procedures to best achieve OpenAI’s mission. “We recognize the magnitude of our role in stewarding transformative technologies for the global good,” added Taylor.

The Special Committee acknowledged the important work done by WilmerHale in conducting this extensive review and thanked OpenAI current and former Board members, advisors and employees for their cooperation. The Special Committee of OpenAI’s Board of Directors released a summary of findings.

Summary of WilmerHale review & findings

On December 8, 2023, the Special Committee retained WilmerHale to conduct a review of the events concerning the November 17, 2023 removal of Sam Altman and Greg Brockman from the OpenAI Board of Directors and Mr. Altman’s termination as CEO. WilmerHale reviewed more than 30,000 documents; conducted dozens of interviews, including of members of OpenAI’s prior Board, OpenAI executives, advisors to the prior Board, and other pertinent witnesses; and evaluated various corporate actions.

The Special Committee provided WilmerHale with the resources and authority necessary to conduct a comprehensive review. Many OpenAI employees, as well as current and former Board members, cooperated with the review process. WilmerHale briefed the Special Committee several times on the progress and conclusions of the review.

WilmerHale evaluated management and governance issues that had been brought to the prior Board’s attention, as well as additional issues that WilmerHale identified in the course of its review. WilmerHale found there was a breakdown in trust between the prior Board and Mr. Altman that precipitated the events of November 17.

WilmerHale reviewed the public post issued by the prior Board on November 17 and concluded that the statement accurately recounted the prior Board’s decision and rationales. WilmerHale found that the prior Board believed at the time that its actions would mitigate internal management challenges and did not anticipate that its actions would destabilize the Company. WilmerHale also found that the prior Board’s decision did not arise out of concerns regarding product safety or security, the pace of development, OpenAI’s finances, or its statements to investors, customers, or business partners. Instead, it was a consequence of a breakdown in the relationship and loss of trust between the prior Board and Mr. Altman. WilmerHale found the prior Board implemented its decision on an abridged timeframe, without advance notice to key stakeholders, and without a full inquiry or an opportunity for Mr. Altman to address the prior Board’s concerns. WilmerHale found that the prior Board acted within its broad discretion to terminate Mr. Altman, but also found that his conduct did not mandate removal.

After reviewing the WilmerHale findings, the Special Committee recommended to the full Board that it endorse the November 21 decision to rehire Mr. Altman and Mr. Brockman. With knowledge of the review’s findings, the Special Committee expressed its full confidence in Mr. Altman and Mr. Brockman’s ongoing leadership of OpenAI.

The Special Committee is pleased to conclude this review and looks forward to continuing with the important work of OpenAI.

Navigating the Challenges and Opportunities of Synthetic Voices

OpenAI — Wed, 06 Mar 2024 15:50:42 GMT

OpenAI is committed to developing safe and broadly beneficial AI. Today we are sharing preliminary insights and results from a small-scale preview of a model called Voice Engine, which uses text input and a single 15-second audio sample to generate natural-sounding speech that closely resembles the original speaker. It is notable that a small model with a single 15-second sample can create emotive and realistic voices.

We first developed Voice Engine in late 2022, and have used it to power the preset voices available in the text-to-speech API as well as ChatGPT Voice and Read Aloud. At the same time, we are taking a cautious and informed approach to a broader release due to the potential for synthetic voice misuse. We hope to start a dialogue on the responsible deployment of synthetic voices, and how society can adapt to these new capabilities. Based on these conversations and the results of these small scale tests, we will make a more informed decision about whether and how to deploy this technology at scale.

Early applications of Voice Engine

To better understand the potential uses of this technology, late last year we started privately testing it with a small group of trusted partners. We've been impressed by the applications this group has developed. These small scale deployments are helping to inform our approach, safeguards, and thinking about how Voice Engine could be used for good across various industries. A few early examples include:

Providing reading assistance to non-readers and children through natural-sounding, emotive voices representing a wider range of speakers than what's possible with preset voices. Age of Learning, an education technology company dedicated to the academic success of children, has been using this to generate pre-scripted voice-over content. They also use Voice Engine and GPT-4 to create real-time, personalized responses to interact with students. With this technology, Age of Learning has been able to create more content for a wider audience.

1. Reference audio

2. Generated audio

Some of the most amazing habitats on Earth are found in the rainforest. A rainforest is a place with a lot of precipitation and it has many kinds of animals trees and other plants. Tropical rainforests are usually not too far from the equator and are warm all year.

Translating content, like videos and podcasts, so creators and businesses can reach more people around the world, fluently and in their own voices. One early adopter of this is HeyGen, an AI visual storytelling platform that works with their enterprise customers to create custom, human-like avatars for a variety of content, from product marketing to sales demos. They use Voice Engine for video translation, so they can translate a speaker's voice into multiple languages and reach a global audience. When used for translation, Voice Engine preserves the native accent of the original speaker: for example generating English with an audio sample from a French speaker would produce speech with a French accent.

1. Reference audio

2. Generated audio

La amistad es un tesoro universal aporta alegría apoyo y risas a nuestras vidas sin importar donde estemos en el mundo. Los verdaderos amigos están con nosotros en las buenas y en las malas compartiendo nuestras alegrías y aliviando nuestras penas. Celebremos los lazos de amistad que nos conectan a todos a través de cada idioma y cultura.

Reaching global communities, by improving essential service delivery in remote settings. Dimagi is building tools for community health workers to provide a variety of essential services, such as counseling for breastfeeding mothers. To help these workers develop their skills, Dimagi uses Voice Engine and GPT-4 to give interactive feedback in each worker's primary language including Swahili or more informal languages like Sheng, a code-mixed language popular in Kenya.

1. Reference audio

2. Generated audio

Lishe bora ni muhimu katika kuhakikisha kwamba watoto wanakua vizuri, kimwili na kiakili. Vyakula kama matunda, mboga, protini, kalsiamu, na vitamini mbalibali ni muhimu sana kwa ukuaji wa mifupa na maendeleo ya ubongo. Kula vizuri kunamaanisha kwamba mtoto anakuwa na mfumu wa kinga imara unaomwezesha kupambana na magonjwa. Hii ina maana kwamba, hata kama kuna mafua yanayoenea mtaani, mtoto atakuwa na uwezo mkubwa wa kukabiliana nayo. Hivyo, hakutakuwa na haja ya kumpeleka hospitalini mara kwa mara. Kwa kufanya hivyo, tunakuwa tunajenga kizazi cha watu imara. Kama unavyojua, mustakabali wa jamii yetu uko mikononi mwa vijana hawa. Ni vyema tuwape mwanzo bora maishani.

Supporting people who are non-verbal, such as therapeutic applications for individuals with conditions that affect speech and educational enhancements for those with learning needs. Livox, an AI alternative communication app, powers Augmentative & Alternative Communication (AAC) devices that enable people with disabilities to communicate. By using Voice Engine, they are able to offer people who are non-verbal unique and non-robotic voices across many languages. Their users can choose speech that best represents them, and for multilingual users, maintain a consistent voice across each spoken language.

1. Reference audio

2. Generated audio

Excuse me can I get your attention? Thank you for your help. Can we watch a movie tonight? Could you please help me find my glasses? Thank you for your understanding, it means a lot to me.

Helping patients recover their voice, for those suffering from sudden or degenerative speech conditions. The Norman Prince Neurosciences Institute at Lifespan, a not-for-profit health system that serves as the primary teaching affiliate of Brown University's medical school, is exploring uses of AI in clinical contexts. They've been piloting a program offering Voice Engine to individuals with oncologic or neurologic etiologies for speech impairment. Since Voice Engine requires such a short audio sample, doctors Fatima Mirza, Rohaid Ali and Konstantina Svokos were able to restore the voice of a young patient who lost her fluent speech due to a vascular brain tumor, using audio from a video recorded for a school project.

1. Current voice

2. Reference audio

3. Generated audio

Hi everyone, this is what my voice sounds like using OpenAI's new text to speech model called Voice Engine. I was able to use just 15 seconds of a video that I made for a class project to be the reference audio source for the voice you hear right now. What do you think?

Building Voice Engine safely

We recognize that generating speech that resembles people's voices has serious risks, which are especially top of mind in an election year. We are engaging with U.S. and international partners from across government, media, entertainment, education, civil society and beyond to ensure we are incorporating their feedback as we build.

The partners testing Voice Engine today have agreed to our usage policies, which prohibit the impersonation of another individual or organization without consent or legal right. In addition, our terms with these partners require explicit and informed consent from the original speaker and we don’t allow developers to build ways for individual users to create their own voices. Partners must also clearly disclose to their audience that the voices they're hearing are AI-generated. Finally, we have implemented a set of safety measures, including watermarking to trace the origin of any audio generated by Voice Engine, as well as proactive monitoring of how it's being used.

We believe that any broad deployment of synthetic voice technology should be accompanied by voice authentication experiences that verify that the original speaker is knowingly adding their voice to the service and a no-go voice list that detects and prevents the creation of voices that are too similar to prominent figures.

Looking ahead

Voice Engine is a continuation of our commitment to understand the technical frontier and openly share what is becoming possible with AI. In line with our approach to AI safety and our voluntary commitments, we are choosing to preview but not widely release this technology at this time. We hope this preview of Voice Engine both underscores its potential and also motivates the need to bolster societal resilience against the challenges brought by ever more convincing generative models. Specifically, we encourage steps like:

Phasing out voice based authentication as a security measure for accessing bank accounts and other sensitive information
Exploring policies to protect the use of individuals' voices in AI
Educating the public in understanding the capabilities and limitations of AI technologies, including the possibility of deceptive AI content
Accelerating the development and adoption of techniques for tracking the origin of audiovisual content, so it's always clear when you're interacting with a real person or with an AI

It's important that people around the world understand where this technology is headed, whether we ultimately deploy it widely ourselves or not. We look forward to continuing to engage in conversations around the challenges and opportunities of synthetic voices with policymakers, researchers, developers and creatives.

OpenAI and Elon Musk

Tue, 05 Mar 2024 16:35:33 GMT

The mission of OpenAI is to ensure AGI benefits all of humanity, which means both building safe and beneficial AGI and helping create broadly distributed benefits. We are now sharing what we've learned about achieving our mission, and some facts about our relationship with Elon.

Update March 27, 2024: We have filed papers seeking dismissal of all of Elon's claims.

We realized building AGI will require far more resources than we’d initially imagined

Elon said we should announce an initial $1B funding commitment to OpenAI. In total, the non-profit has raised less than $45M from Elon and more than $90M from other donors.

When starting OpenAI in late 2015, Greg and Sam had initially planned to raise $100M. Elon said in an email: “We need to go with a much bigger number than $100M to avoid sounding hopeless… I think we should say that we are starting with a $1B funding commitment… I will cover whatever anyone else doesn't provide.” [1]

We spent a lot of time trying to envision a plausible path to AGI. In early 2017, we came to the realization that building AGI will require vast quantities of compute. We began calculating how much compute an AGI might plausibly require. We all understood we were going to need a lot more capital to succeed at our mission—billions of dollars per year, which was far more than any of us, especially Elon, thought we’d be able to raise as the non-profit.

We and Elon recognized a for-profit entity would be necessary to acquire those resources

As we discussed a for-profit structure in order to further the mission, Elon wanted us to merge with Tesla or he wanted full control. Elon left OpenAI, saying there needed to be a relevant competitor to Google/DeepMind and that he was going to do it himself. He said he’d be supportive of us finding our own path.

In late 2017, we and Elon decided the next step for the mission was to create a for-profit entity. Elon wanted majority equity, initial board control, and to be CEO. In the middle of these discussions, he withheld funding. Reid Hoffman bridged the gap to cover salaries and operations.

We couldn’t agree to terms on a for-profit with Elon because we felt it was against the mission for any individual to have absolute control over OpenAI. He then suggested instead merging OpenAI into Tesla. In early February 2018, Elon forwarded us an email suggesting that OpenAI should “attach to Tesla as its cash cow”, commenting that it was “exactly right… Tesla is the only path that could even hope to hold a candle to Google. Even then, the probability of being a counterweight to Google is small. It just isn’t zero”. [2]

Elon soon chose to leave OpenAI, saying that our probability of success was 0, and that he planned to build an AGI competitor within Tesla. When he left in late February 2018, he told our team he was supportive of us finding our own path to raising billions of dollars. In December 2018, Elon sent us an email saying “Even raising several hundred million won’t be enough. This needs billions per year immediately or forget it.” [3]

We advance our mission by building widely-available beneficial tools

We’re making our technology broadly usable in ways that empower people and improve their daily lives, including via open-source contributions.

We provide broad access to today's most powerful AI, including a free version that hundreds of millions of people use every day. For example, Albania is using OpenAI’s tools to accelerate its EU accession by as much as 5.5 years; Digital Green is helping boost farmer income in Kenya and India by dropping the cost of agricultural extension services 100x by building on OpenAI; Lifespan, the largest healthcare provider in Rhode Island, uses GPT-4 to simplify its surgical consent forms from a college reading level to a 6th grade one; Iceland is using GPT-4 to preserve the Icelandic language.

Elon understood the mission did not imply open-sourcing AGI. As Ilya told Elon: “As we get closer to building AI, it will make sense to start being less open. The Open in openAI means that everyone should benefit from the fruits of AI after its built, but it's totally OK to not share the science...”, to which Elon replied: “Yup”. [4]

We're sad that it's come to this with someone whom we’ve deeply admired—someone who inspired us to aim higher, then told us we would fail, started a competitor, and then sued us when we started making meaningful progress towards OpenAI’s mission without him.

We are focused on advancing our mission and have a long way to go. As we continue to make our tools better and better, we are excited to deploy these systems so they empower every individual.

Update March 11, 2024: We are seeking to have the lawsuit assigned to dedicated case management, since it involves AI technology and the claims span almost a decade.

[1]

From:

Elon Musk <

To:

Greg Brockman <

CC:

Sam Altman <

Date: Sun, Nov 22, 2015 at 7:48 PM

Subject: follow up from call

Blog sounds good, assuming adjustments for neutrality vs being YC-centric.

I'd favor positioning the blog to appeal a bit more to the general public -- there is a lot of value to having the public root for us to succeed -- and then having a longer, more detailed and inside-baseball version for recruiting, with a link to it at the end of the general public version.

We need to go with a much bigger number than $100M to avoid sounding hopeless relative to what Google or Facebook are spending. I think we should say that we are starting with a $1B funding commitment. This is real. I will cover whatever anyone else doesn't provide.

Template seems fine, apart from shifting to a vesting cash bonus as default, which can optionally be turned into YC or potentially SpaceX (need to understand how much this will be) stock.

[2]

From:

Elon Musk <

To:

Ilya Sutskever <

>, Greg Brockman <

Date: Thu, Feb 1, 2018 at 3:52 AM

Subject: Fwd: Top AI institutions today

is exactly right. We may wish it otherwise, but, in my and

’s opinion, Tesla is the only path that could even hope to hold a candle to Google. Even then, the probability of being a counterweight to Google is small. It just isn't zero.

Begin forwarded message:

From:

To:

Elon Musk <

Date: January 31, 2018 at 11:54:30 PM PST

Subject: Re: Top AI institutions today

Working at the cutting edge of AI is unfortunately expensive. For example,

In addition to DeepMind, Google also has Google Brain, Research, and Cloud. And TensorFlow, TPUs, and they own about a third of all research (in fact, they hold their own AI conferences).

I also strongly suspect that compute horsepower will be necessary (and possibly even sufficient) to reach AGI. If historical trends are any indication, progress in AI is primarily driven by systems - compute, data, infrastructure. The core algorithms we use today have remained largely unchanged from the ~90s. Not only that, but any algorithmic advances published in a paper somewhere can be almost immediately re-implemented and incorporated. Conversely, algorithmic advances alone are inert without the scale to also make them scary.

It seems to me that OpenAI today is burning cash and that the funding model cannot reach the scale to seriously compete with Google (an 800B company). If you can't seriously compete but continue to do research in open, you might in fact be making things worse and helping them out “for free”, because any advances are fairly easy for them to copy and immediately incorporate, at scale.

A for-profit pivot might create a more sustainable revenue stream over time and would, with the current team, likely bring in a lot of investment. However, building out a product from scratch would steal focus from AI research, it would take a long time and it's unclear if a company could “catch up” to Google scale, and the investors might exert too much pressure in the wrong directions.The most promising option I can think of, as I mentioned earlier, would be for OpenAI to attach to Tesla as its cash cow. I believe attachments to other large suspects (e.g. Apple? Amazon?) would fail due to an incompatible company DNA. Using a rocket analogy, Tesla already built the “first stage” of the rocket with the whole supply chain of Model 3 and its onboard computer and a persistent internet connection. The “second stage” would be a full self driving solution based on large-scale neural network training, which OpenAI expertise could significantly help accelerate. With a functioning full self-driving solution in ~2-3 years we could sell a lot of cars/trucks. If we do this really well, the transportation industry is large enough that we could increase Tesla's market cap to high O(~100K), and use that revenue to fund the AI work at the appropriate scale.

I cannot see anything else that has the potential to reach sustainable Google-scale capital within a decade.

[3]

From:

Elon Musk <

To:

Ilya Sutskever <

>, Greg Brockman <

CC:

Sam Altman <

Date: Wed, Dec 26, 2018 at 12:07 PM

Subject: I feel I should reiterate

My probability assessment of OpenAI being relevant to DeepMind/Google without a dramatic change in execution and resources is 0%. Not 1%. I wish it were otherwise.

Even raising several hundred million won't be enough. This needs billions per year immediately or forget it.

Unfortunately, humanity's future is in the hands of

And they are doing a lot more than this.

I really hope I'm wrong.

Elon

[4]

Fwd: congrats on the falcon 93 messages

From:

Elon Musk <

To:

Sam Altman <

>, Ilya Sutskever <

>, Greg Brockman <

Date: Sat, Jan 2, 2016 at 8:18 AM

Subject: Fwd: congrats on the falcon 9

Begin forwarded message:

From:

To:

Elon Musk <

Date: January 2, 2016 at 10:12:32 AM CST

Subject: congrats on the falcon 9

Hi Elon

Happy new year to you,

!

Congratulations on landing the Falcon 9, what an amazing achievement. Time to build out the fleet now!

I've seen you (and Sam and other OpenAI people) doing a lot of interviews recently extolling the virtues of open sourcing AI, but I presume you realise that this is not some sort of panacea that will somehow magically solve the safety problem? There are many good arguments as to why the approach you are taking is actually very dangerous and in fact may increase the risk to the world. Some of the more obvious points are well articulated in this blog post, that I'm sure you've seen, but there are also other important considerations:
http://slatestarcodex.com/2015/12/17/should-ai-be-open/

I’d be interested to hear your counter-arguments to these points.

Best

From:

Ilya Sutskever <

To:

Elon Musk <

>, Sam Altman <

>, Greg Brockman <

Date: Sat, Jan 2, 2016 at 9:06 AM

Subject: Fwd: congrats on the falcon 9

The article is concerned with a hard takeoff scenario: if a hard takeoff occurs, and a safe AI is harder to build than an unsafe one, then by opensorucing everything, we make it easy for someone unscrupulous with access to overwhelming amount of hardware to build an unsafe AI, which will experience a hard takeoff.

As we get closer to building AI, it will make sense to start being less open. The Open in openAI means that everyone should benefit from the fruits of AI after its built, but it's totally OK to not share the science (even though sharing everything is definitely the right strategy in the short and possibly medium term for recruitment purposes).

From:

Elon Musk <

To:

Ilya Sutskever <

Date: Sat, Jan 2, 2016 at 9:11 AM

Subject: Fwd: congrats on the falcon 9

Yup

Disrupting malicious uses of AI by state-affiliated threat actors

OpenAI — Tue, 13 Feb 2024 22:06:03 GMT

We build AI tools that improve lives and help solve complex challenges, but we know that malicious actors will sometimes try to abuse our tools to harm others, including in furtherance of cyber operations. Among those malicious actors, state-affiliated groups—which may have access to advanced technology, large financial resources, and skilled personnel—can pose unique risks to the digital ecosystem and human welfare.

In partnership with Microsoft Threat Intelligence, we have disrupted five state-affiliated actors that sought to use AI services in support of malicious cyber activities. We also outline our approach to detect and disrupt such actors in order to promote information sharing and transparency regarding their activities.

Disruption of threat actors

Based on collaboration and information sharing with Microsoft, we disrupted five state-affiliated malicious actors: two China-affiliated threat actors known as Charcoal Typhoon and Salmon Typhoon; the Iran-affiliated threat actor known as Crimson Sandstorm; the North Korea-affiliated actor known as Emerald Sleet; and the Russia-affiliated actor known as Forest Blizzard. The identified OpenAI accounts associated with these actors were terminated.

These actors generally sought to use OpenAI services for querying open-source information, translating, finding coding errors, and running basic coding tasks.

Specifically:

Charcoal Typhoon used our services to research various companies and cybersecurity tools, debug code and generate scripts, and create content likely for use in phishing campaigns.
Salmon Typhoon used our services to translate technical papers, retrieve publicly available information on multiple intelligence agencies and regional threat actors, assist with coding, and research common ways processes could be hidden on a system.
Crimson Sandstorm used our services for scripting support related to app and web development, generating content likely for spear-phishing campaigns, and researching common ways malware could evade detection.
Emerald Sleet used our services to identify experts and organizations focused on defense issues in the Asia-Pacific region, understand publicly available vulnerabilities, help with basic scripting tasks, and draft content that could be used in phishing campaigns.
Forest Blizzard used our services primarily for open-source research into satellite communication protocols and radar imaging technology, as well as for support with scripting tasks.

Additional technical details on the nature of the threat actors and their activities can be found in the Microsoft blog post published today.

The activities of these actors are consistent with previous red team assessments we conducted in partnership with external cybersecurity experts, which found that GPT-4 offers only limited, incremental capabilities for malicious cybersecurity tasks beyond what is already achievable with publicly available, non-AI powered tools.

A multi-pronged approach to AI safety

Although the capabilities of our current models for malicious cybersecurity tasks are limited, we believe it’s important to stay ahead of significant and evolving threats. To respond to the threat, we are taking a multi-pronged approach to combating malicious state-affiliated actors’ use of our platform:

Monitoring and disrupting malicious state affiliated actors. We invest in technology and teams to identify and disrupt sophisticated threat actors’ activities. Our Intelligence and Investigations, Safety, Security, and Integrity teams investigate malicious actors in a variety of ways, including using our models to pursue leads, analyze how adversaries are interacting with our platform, and assess their broader intentions. Upon detection, OpenAI takes appropriate action to disrupt their activities, such as disabling their accounts, terminating services, or limiting access to resources.
Working together with the AI ecosystem. OpenAI collaborates with industry partners and other stakeholders to regularly exchange information about malicious state-affiliated actors’ detected use of AI. This collaboration reflects our voluntary commitment to promote the safe, secure and transparent development and use of AI technology, and aims to promote collective responses to ecosystem-wide risks via information sharing.
Iterating on safety mitigations. Learning from real-world use (and misuse) is a key component of creating and releasing increasingly safe AI systems over time. We take lessons learned from these actors' abuse and use them to inform our iterative approach to safety. Understanding how the most sophisticated malicious actors seek to use our systems for harm gives us a signal into practices that may become more widespread in the future, and allows us to continuously evolve our safeguards.
Public transparency. We have long sought to highlight potential misuses of AI [link 1, link 2] and share what we have learned about safety [link 1, link 2] with the industry and the public. As part of our ongoing efforts to advance responsible use of AI, OpenAI will continue to inform the public and stakeholders about the nature and extent of malicious state-affiliated actors’ use of AI detected within our systems and the measures taken against them, when warranted. We believe that sharing and transparency foster greater awareness and preparedness among all stakeholders, leading to stronger collective defense against ever-evolving adversaries.

The vast majority of people use our systems to help improve their daily lives, from virtual tutors for students to apps that can transcribe the world for people who are seeing impaired. As is the case with many other ecosystems, there are a handful of malicious actors that require sustained attention so that everyone else can continue to enjoy the benefits. Although we work to minimize potential misuse by such actors, we will not be able to stop every instance. But by continuing to innovate, investigate, collaborate, and share, we make it harder for malicious actors to remain undetected across the digital ecosystem and improve the experience for everyone else.

Memory and new controls for ChatGPT

OpenAI — Mon, 12 Feb 2024 17:52:14 GMT

We’re testing memory with ChatGPT. Remembering things you discuss across all chats saves you from having to repeat information and makes future conversations more helpful.

You’re in control of ChatGPT’s memory. You can explicitly tell it to remember something, ask it what it remembers, and tell it to forget conversationally or through settings. You can also turn it off entirely.

We are rolling out to a small portion of ChatGPT free and Plus users this week to learn how useful it is. We will share plans for broader roll out soon.

How memory works

As you chat with ChatGPT, you can ask it to remember something specific or let it pick up details itself. ChatGPT’s memory will get better the more you use it and you'll start to notice the improvements over time. For example:

You’ve explained that you prefer meeting notes to have headlines, bullets and action items summarized at the bottom. ChatGPT remembers this and recaps meetings this way.
You’ve told ChatGPT you own a neighborhood coffee shop. When brainstorming messaging for a social post celebrating a new location, ChatGPT knows where to start.
You mention that you have a toddler and that she loves jellyfish. When you ask ChatGPT to help create her birthday card, it suggests a jellyfish wearing a party hat.
As a kindergarten teacher with 25 students, you prefer 50-minute lessons with follow-up activities. ChatGPT remembers this when helping you create lesson plans.

You’re in control

You can turn off memory at any time (Settings > Personalization > Memory). While memory is off, you won't create or use memories.

If you want ChatGPT to forget something, just tell it. You can also view and delete specific memories or clear all memories in settings (Settings > Personalization > Manage Memory). ChatGPT's memories evolve with your interactions and aren't linked to specific conversations. Deleting a chat doesn't erase its memories; you must delete the memory itself. You can find more details in our Help Center.

We may use content that you provide to ChatGPT, including memories, to improve our models for everyone. If you’d like, you can turn this off through your Data Controls. As always, we won't train on content from ChatGPT Team and Enterprise customers. Learn more about how we use content to train our models and your choices in our Help Center.

Use temporary chat for conversations without memory

If you’d like to have a conversation without using memory, use temporary chat. Temporary chats won't appear in history, won't use memory, and won't be used to train our models. Learn more about temporary chats in our Help Center.

Custom instructions also allow ChatGPT to be more helpful

Custom Instructions continue to allow you to provide ChatGPT with direct guidance on what you’d like it to know about you and how you’d like it to respond. For explicit information or instructions, you can add it to your Custom Instructions. For information shared via conversations, ChatGPT can remember relevant details for you.

Evolving our privacy and safety standards

Memory brings additional privacy and safety considerations, such as what type of information should be remembered and how it’s used. We’re taking steps to assess and mitigate biases, and steer ChatGPT away from proactively remembering sensitive information, like your health details - unless you explicitly ask it to.

Team and Enterprise customers can work more efficiently

For Enterprise and Team users, memory can be useful when using ChatGPT for work. It can learn your style and preferences, and build upon past interactions. This saves you time and leads to more relevant and insightful responses. For example:

ChatGPT can remember your tone, voice, and format preferences, and automatically apply them to blog post drafts without needing repetition.
When coding, you tell ChatGPT your programming language and frameworks. It can remember these preferences for subsequent tasks, streamlining the process.
For monthly business reviews, you securely upload your data to ChatGPT and it creates your preferred charts with three takeaways each.

As with any ChatGPT feature, you’re in control of your organization’s data. Memories and any other information on your workspace are excluded from training our models. Users have control on how and when their memories are used in chats. In addition, Enterprise account owners can turn memory off for their organization at any time.

Enterprise and Team users will have access to memory as part of our wider rollout.

GPTs will also have memory

GPTs will have their own distinct memory. Builders will have the option to enable memory for their GPTs. Like your chats, memories are not shared with builders. To interact with a memory-enabled GPT, you will also need to have memory on. For example:

The Books GPT helps you find your next read. With memory enabled, it remembers your preferences, such as favorite genres or top books, and tailors recommendations accordingly, without needing repeated inputs.

Each GPT has its own memory, so you might need to repeat details you’ve previously shared with ChatGPT. For example:

If you're using the Artful Greeting Card GPT to create a birthday card for your daughter, it won’t know her age or that she loves jellyfish. You’ll need to tell it the relevant details.

Memory for GPTs will be available when we roll it out more broadly.

New embedding models and API updates

OpenAI — Tue, 23 Jan 2024 16:46:06 GMT

We are releasing new models, reducing prices for GPT-3.5 Turbo, and introducing new ways for developers to manage API keys and understand API usage. The new models include:

Two new embedding models
An updated GPT-4 Turbo preview model
An updated GPT-3.5 Turbo model
An updated text moderation model

By default, data sent to the OpenAI API will not be used to train or improve OpenAI models.

New embedding models with lower pricing

We are introducing two new embedding models: a smaller and highly efficient text-embedding-3-small model, and a larger and more powerful text-embedding-3-large model.

An embedding is a sequence of numbers that represents the concepts within content such as natural language or code. Embeddings make it easy for machine learning models and other algorithms to understand the relationships between content and to perform tasks like clustering or retrieval. They power applications like knowledge retrieval in both ChatGPT and the Assistants API, and many retrieval augmented generation (RAG) developer tools.

A new small text embedding model

text-embedding-3-small is our new highly efficient embedding model and provides a significant upgrade over its predecessor, the text-embedding-ada-002 model released in December 2022.

Stronger performance. Comparing text-embedding-ada-002 to text-embedding-3-small, the average score on a commonly used benchmark for multi-language retrieval (MIRACL) has increased from 31.4% to 44.0%, while the average score on a commonly used benchmark for English tasks (MTEB) has increased from 61.0% to 62.3%.

Reduced price. text-embedding-3-small is also substantially more efficient than our previous generation text-embedding-ada-002 model. Pricing for text-embedding-3-small has therefore been reduced by 5X compared to text-embedding-ada-002, from a price per 1k tokens of $0.0001 to $0.00002.

We are not deprecating text-embedding-ada-002, so while we recommend the newer model, customers are welcome to continue using the previous generation model.

A new large text embedding model: text-embedding-3-large

text-embedding-3-large is our new next generation larger embedding model and creates embeddings with up to 3072 dimensions.

Stronger performance. text-embedding-3-large is our new best performing model. Comparing text-embedding-ada-002 to text-embedding-3-large: on MIRACL, the average score has increased from 31.4% to 54.9%, while on MTEB, the average score has increased from 61.0% to 64.6%.

Eval benchmark	ada v2	text-embedding-3-small	text-embedding-3-large
MIRACL average	31.4	44.0	54.9
MTEB average	61.0	62.3	64.6

text-embedding-3-large will be priced at $0.00013 / 1k tokens.

You can learn more about using the new embedding models in our Embeddings guide.

Native support for shortening embeddings

Using larger embeddings, for example storing them in a vector store for retrieval, generally costs more and consumes more compute, memory and storage than using smaller embeddings.

Both of our new embedding models were trained with a technique^{[^footnote-technique]} that allows developers to trade-off performance and cost of using embeddings. Specifically, developers can shorten embeddings (i.e. remove some numbers from the end of the sequence) without the embedding losing its concept-representing properties by passing in the dimensions API parameter. For example, on the MTEB benchmark, a text-embedding-3-large embedding can be shortened to a size of 256 while still outperforming an unshortened text-embedding-ada-002 embedding with a size of 1536.

	ada v2	text-embedding-3-small		text-embedding-3-large
Embedding size	1536	512	1536	256	1024	3072
Average MTEB score	61.0	61.6	62.3	62.0	64.1	64.6

This enables very flexible usage. For example, when using a vector data store that only supports embeddings up to 1024 dimensions long, developers can now still use our best embedding model text-embedding-3-large and specify a value of 1024 for the dimensions API parameter, which will shorten the embedding down from 3072 dimensions, trading off some accuracy in exchange for the smaller vector size.

Other new models and lower pricing

Updated GPT-3.5 Turbo model and lower pricing

Next week we are introducing a new GPT-3.5 Turbo model, gpt-3.5-turbo-0125, and for the third time in the past year, we will be decreasing prices on GPT-3.5 Turbo to help our customers scale. Input prices for the new model are reduced by 50% to $0.0005 /1K tokens and output prices are reduced by 25% to $0.0015 /1K tokens. This model will also have various improvements including higher accuracy at responding in requested formats and a fix for a bug which caused a text encoding issue for non-English language function calls.

Customers using the unpinned gpt-3.5-turbo model alias will be automatically upgraded from gpt-3.5-turbo-0613 to gpt-3.5-turbo-0125 two weeks after this model launches.

Updated GPT-4 Turbo preview

Over 70% of requests from GPT-4 API customers have transitioned to GPT-4 Turbo since its release, as developers take advantage of its updated knowledge cutoff, larger 128k context windows, and lower prices.

Today, we are releasing an updated GPT-4 Turbo preview model, gpt-4-0125-preview. This model completes tasks like code generation more thoroughly than the previous preview model and is intended to reduce cases of “laziness” where the model doesn’t complete a task. The new model also includes the fix for the bug impacting non-English UTF-8 generations.

For those who want to be automatically upgraded to new GPT-4 Turbo preview versions, we are also introducing a new gpt-4-turbo-preview model name alias, which will always point to our latest GPT-4 Turbo preview model.

We plan to launch GPT-4 Turbo with vision in general availability in the coming months.

Updated moderation model

The free Moderation API allows developers to identify potentially harmful text. As part of our ongoing safety work, we are releasing text-moderation-007, our most robust moderation model to-date. The text-moderation-latest and text-moderation-stable aliases have been updated to point to it. You can learn more about building safe AI systems through our safety best practices guide.

New ways to understand API usage and manage API keys

We are launching two platform improvements to give developers both more visibility into their usage and control over API keys.

First, developers can now assign permissions to API keys from the API keys page. For example, a key could be assigned read-only access to power an internal tracking dashboard, or restricted to only access certain endpoints.

Second, the usage dashboard and usage export function now expose metrics on an API key level after turning on tracking. This makes it simple to view usage on a per feature, team, product, or project level, simply by having separate API keys for each.

In the coming months, we plan to further improve the ability for developers to view their API usage and manage API keys, especially in larger organizations.

For the latest updates on OpenAI's APIs, follow us on X at @OpenAIDevs.

How OpenAI is approaching 2024 worldwide elections

OpenAI — Fri, 12 Jan 2024 21:01:35 GMT

Protecting the integrity of elections requires collaboration from every corner of the democratic process, and we want to make sure our technology is not used in a way that could undermine this process.

Our tools empower people to improve their daily lives and solve complex problems—from using AI to enhance state services to simplifying medical forms for patients.

We want to make sure that our AI systems are built, deployed, and used safely. Like any new technology, these tools come with benefits and challenges. They are also unprecedented, and we will keep evolving our approach as we learn more about how our tools are used.

As we prepare for elections in 2024 across the world’s largest democracies, our approach is to continue our platform safety work by elevating accurate voting information, enforcing measured policies, and improving transparency. We have a cross-functional effort dedicated to election work, bringing together expertise from our safety systems, threat intelligence, legal, engineering, and policy teams to quickly investigate and address potential abuse.

The following are key initiatives our teams are investing in to prepare for elections this year:

Preventing abuse

We expect and aim for people to use our tools safely and responsibly, and elections are no different. We work to anticipate and prevent relevant abuse—such as misleading “deepfakes”, scaled influence operations, or chatbots impersonating candidates. Prior to releasing new systems, we red team them, engage users and external partners for feedback, and build safety mitigations to reduce the potential for harm. For years, we’ve been iterating on tools to improve factual accuracy, reduce bias, and decline certain requests. These tools provide a strong foundation for our work around election integrity. For instance, DALL·E has guardrails to decline requests that ask for image generation of real people, including candidates.

We regularly refine our Usage Policies for ChatGPT and the API as we learn more about how people use or attempt to abuse our technology. A few to highlight for elections:

We’re still working to understand how effective our tools might be for personalized persuasion. Until we know more, we don’t allow people to build applications for political campaigning and lobbying.
People want to know and trust that they are interacting with a real person, business, or government. For that reason, we don’t allow builders to create chatbots that pretend to be real people (e.g., candidates) or institutions (e.g., local government).
We don’t allow applications that deter people from participation in democratic processes—for example, misrepresenting voting processes and qualifications (e.g., when, where, or who is eligible to vote) or that discourage voting (e.g., claiming a vote is meaningless).
With our new GPTs, users can report potential violations to us.

With our new GPTs, users can report potential violations to us.

Transparency around AI-generated content

Better transparency around image provenance—including the ability to detect which tools were used to produce an image—can empower voters to assess an image with trust and confidence in how it was made. We’re working on several provenance efforts. We implemented the Coalition for Content Provenance and Authenticity’s digital credentials—an approach that encodes details about the content’s provenance using cryptography—for images generated by DALL·E 3.

We are also experimenting with a provenance classifier, a new tool for detecting images generated by DALL·E. Our internal testing has shown promising early results, even where images have been subject to common types of modifications. We plan to soon make it available to our first group of testers—including journalists, platforms, and researchers—for feedback.

Finally, ChatGPT is increasingly integrating with existing sources of information—for example, users will start to get access to real-time news reporting globally, including attribution and links. Transparency around the origin of information and balance in news sources can help voters better assess information and decide for themselves what they can trust.

Improving access to authoritative voting information

In the United States, we are working with the National Association of Secretaries of State (NASS), the nation's oldest nonpartisan professional organization for public officials. ChatGPT will direct users to CanIVote.org, the authoritative website on US voting information, when asked certain procedural election related questions—for example, where to vote. Lessons from this work will inform our approach in other countries and regions.

We’ll have more to share in the coming months. We look forward to continuing to work with and learn from partners to anticipate and prevent potential abuse of our tools in the lead up to this year’s global elections.

OpenAI and journalism

OpenAI — Sun, 07 Jan 2024 00:04:41 GMT

Our goal is to develop AI tools that empower people to solve problems that are otherwise out of reach. People worldwide are already using our technology to improve their daily lives. Millions of developers and more than 92% of Fortune 500 are building on our products today.

While we disagree with the claims in The New York Times lawsuit, we view it as an opportunity to clarify our business, our intent, and how we build our technology. Our position can be summed up in these four points, which we flesh out below:

We collaborate with news organizations and are creating new opportunities
Training is fair use, but we provide an opt-out because it’s the right thing to do
“Regurgitation” is a rare bug that we are working to drive to zero
The New York Times is not telling the full story

1. We collaborate with news organizations and are creating new opportunities

We work hard in our technology design process to support news organizations. We’ve met with dozens, as well as leading industry organizations like the News/Media Alliance, to explore opportunities, discuss their concerns, and provide solutions. We aim to learn, educate, listen to feedback, and adapt.

Our goals are to support a healthy news ecosystem, be a good partner, and create mutually beneficial opportunities. With this in mind, we have pursued partnerships with news organizations to achieve these objectives:

Deploy our products to benefit and support reporters and editors, by assisting with time-consuming tasks like analyzing voluminous public records and translating stories.
Teach our AI models about the world by training on additional historical, non-publicly available content.
Display real-time content with attribution in ChatGPT, providing new ways for news publishers to connect with readers.

Our early partnerships with the Associated Press, Axel Springer, American Journalism Project and NYU offer a glimpse into our approach.

2. Training is fair use, but we provide an opt-out because it’s the right thing to do

Training AI models using publicly available internet materials is fair use, as supported by long-standing and widely accepted precedents. We view this principle as fair to creators, necessary for innovators, and critical for US competitiveness.

The principle that training AI models is permitted as a fair use is supported by a wide range of academics, library associations, civil society groups, startups, leading US companies, creators, authors, and others that recently submitted comments to the US Copyright Office. Other regions and countries, including the European Union, Japan, Singapore, and Israel also have laws that permit training models on copyrighted content—an advantage for AI innovation, advancement, and investment.

That being said, legal right is less important to us than being good citizens. We have led the AI industry in providing a simple opt-out process for publishers (which The New York Times adopted in August 2023) to prevent our tools from accessing their sites.

3. “Regurgitation” is a rare bug that we are working to drive to zero

Our models were designed and trained to learn concepts in order to apply them to new problems.

Memorization is a rare failure of the learning process that we are continually making progress on, but it’s more common when particular content appears more than once in training data, like if pieces of it appear on lots of different public websites. So we have measures in place to limit inadvertent memorization and prevent regurgitation in model outputs. We also expect our users to act responsibly; intentionally manipulating our models to regurgitate is not an appropriate use of our technology and is against our terms of use.

Just as humans obtain a broad education to learn how to solve new problems, we want our AI models to observe the range of the world’s information, including from every language, culture, and industry. Because models learn from the enormous aggregate of human knowledge, any one sector—including news—is a tiny slice of overall training data, and any single data source—including The New York Times—is not significant for the model’s intended learning.

4. The New York Times is not telling the full story

Our discussions with The New York Times had appeared to be progressing constructively through our last communication on December 19. The negotiations focused on a high-value partnership around real-time display with attribution in ChatGPT, in which The New York Times would gain a new way to connect with their existing and new readers, and our users would gain access to their reporting. We had explained to The New York Times that, like any single source, their content didn't meaningfully contribute to the training of our existing models and also wouldn't be sufficiently impactful for future training. Their lawsuit on December 27—which we learned about by reading The New York Times—came as a surprise and disappointment to us.

Along the way, they had mentioned seeing some regurgitation of their content but repeatedly refused to share any examples, despite our commitment to investigate and fix any issues. We’ve demonstrated how seriously we treat this as a priority, such as in July when we took down a ChatGPT feature immediately after we learned it could reproduce real-time content in unintended ways.

Interestingly, the regurgitations The New York Times induced appear to be from years-old articles that have proliferated on multiple third-party websites. It seems they intentionally manipulated prompts, often including lengthy excerpts of articles, in order to get our model to regurgitate. Even when using such prompts, our models don’t typically behave the way The New York Times insinuates, which suggests they either instructed the model to regurgitate or cherry-picked their examples from many attempts.

Despite their claims, this misuse is not typical or allowed user activity, and is not a substitute for The New York Times. Regardless, we are continually making our systems more resistant to adversarial attacks to regurgitate training data, and have already made much progress in our recent models.

We regard The New York Times’ lawsuit to be without merit. Still, we are hopeful for a constructive partnership with The New York Times and respect its long history, which includes reporting the first working neural network over 60 years ago and championing First Amendment freedoms.

We look forward to continued collaboration with news organizations, helping elevate their ability to produce quality journalism by realizing the transformative potential of AI.

Introducing ChatGPT Team

OpenAI — Sat, 06 Jan 2024 23:43:09 GMT

We launched ChatGPT Enterprise a few months ago and industry leaders like Block, Canva, Carlyle, The Estée Lauder Companies, PwC, and Zapier are already using it to redefine how their organizations operate. Today, we’re adding a new self-serve plan: ChatGPT Team.

ChatGPT Team offers access to our advanced models like GPT-4 and DALL·E 3, and tools like Advanced Data Analysis. It additionally includes a dedicated collaborative workspace for your team and admin tools for team management. As with ChatGPT Enterprise, you own and control your business data—we do not train on your business data or conversations, and our models don’t learn from your usage. More details on our data privacy practices can be found on our privacy page and Trust Portal.

ChatGPT Team includes:

Access to GPT-4 with 32K context window
Tools like DALL·E 3, GPT-4 with Vision, Browsing, Advanced Data Analysis—with higher message caps
No training on your business data or conversations
Secure workspace for your team
Create and share custom GPTs with your workspace
Admin console for workspace and team management
Early access to new features and improvements

Start now

Customize ChatGPT for any type of work

We recently announced GPTs—custom versions of ChatGPT that you can create for a specific purpose with instructions, expanded knowledge, and custom capabilities. These can be especially useful for businesses and teams. With GPTs, you can customize ChatGPT to your team’s specific needs and workflows (no code required) and publish them securely to your team’s workspace. GPTs can help with a wide range of tasks, such as assisting in project management, team onboarding, generating code, performing data analysis, securely taking action in your existing systems and tools, or creating collateral to match your brand tone and voice. Today, we announced the GPT Store where you can find useful and popular GPTs from your workspace.

Improve team efficiency and work quality

Integrating AI into everyday organizational workflows can make your team more productive. In a recent study by the Harvard Business School, employees at Boston Consulting Group who were given access to GPT-4 reported completing tasks 25% faster and achieved a 40% higher quality in their work as compared to their peers who did not have access.^{[^study]}

Connor O’Brien, VP of GTM Strategy & Operations at Sourcegraph, shares, "We use ChatGPT in almost every part of our business, from financial modeling for pricing and packaging to internal and external communications to board prep to recruiting and note taking—it’s accelerated everything we do allowing us to execute at a high level."

Dr. John Brownstein, Chief Innovation Officer at Boston Children’s Hospital says, “With ChatGPT Team, we’ve been able to pilot innovative GPTs that enhance our team’s productivity and collaboration. As we integrate GPTs safely and responsibly across internal operations, we know the transformative impact this will have in strengthening the systems that enable our doctors, researchers, students, and administrative staff to provide exceptional care to every patient that walks through our doors.”

ChatGPT Team costs $25/month per user when billed annually, or $30/month per user when billed monthly. You can explore the details or get started now by upgrading in your ChatGPT settings.

Explore ChatGPT Team Start now

Introducing the GPT Store

OpenAI — Thu, 04 Jan 2024 23:33:54 GMT

It’s been two months since we announced GPTs, and users have already created over 3 million custom versions of ChatGPT. Many builders have shared their GPTs for others to use. Today, we're starting to roll out the GPT Store to ChatGPT Plus, Team and Enterprise users so you can find useful and popular GPTs. Visit chat.openai.com/gpts to explore.

The store features a diverse range of GPTs developed by our partners and the community. Browse popular and trending GPTs on the community leaderboard, with categories like DALL·E, writing, research, programming, education, and lifestyle.

New featured GPTs every week

We will also highlight useful and impactful GPTs. Some of our first featured GPTs include:

Personalized trail recommendations from AllTrails
Search and synthesize results from 200M academic papers with Consensus
Expand your coding skills with Khan Academy’s Code Tutor
Design presentations or social posts with Canva
Find your next read with Books
Learn math and science anytime, anywhere with the CK-12 Flexi AI tutor

Include your GPT in the store

Building your own GPT is simple and doesn't require any coding skills.

If you’d like to share a GPT in the store, you’ll need to:

Save your GPT for Everyone (Anyone with a link will not be shown in the store).
Verify your Builder Profile (Settings → Builder profile → Enable your name or a verified website).

Please review our latest usage policies and GPT brand guidelines to ensure your GPT is compliant. To help ensure GPTs adhere to our policies, we've established a new review system in addition to the existing safety measures we've built into our products. The review process includes both human and automated review. Users are also able to report GPTs.

Builders can earn based on GPT usage

In Q1 we will launch a GPT builder revenue program. As a first step, US builders will be paid based on user engagement with their GPTs. We'll provide details on the criteria for payments as we get closer.

Team and Enterprise customers can manage GPTs

Today, we announced our new ChatGPT Team plan for teams of all sizes. Team customers have access to a private section of the GPT Store which includes GPTs securely published to your workspace. The GPT Store will be available soon for ChatGPT Enterprise customers and will include enhanced admin controls like choosing how internal-only GPTs are shared and which external GPTs may be used inside your business. Like all usage on ChatGPT Team and Enterprise, we do not use your conversations with GPTs to improve our models.

Explore GPTs at chat.openai.com/gpts.

Explore GPTs

Democratic inputs to AI grant program: lessons learned and implementation plans

Tyna Eloundou, Teddy Lee — Mon, 13 Nov 2023 23:44:18 GMT

As AI gets more advanced and widely used, it is essential to involve the public in deciding how AI should behave in order to better align our models to the values of humanity. In May, we announced the Democratic Inputs to AI grant program. We then awarded $100,000 to 10 teams out of nearly 1000 applicants to design, build, and test ideas that use democratic methods to decide the rules that govern AI systems. Throughout, the teams tackled challenges like recruiting diverse participants across the digital divide, producing a coherent output that represents diverse viewpoints, and designing processes with sufficient transparency to be trusted by the public.

At OpenAI, we’ll build on this momentum by designing an end-to-end process for collecting inputs from external stakeholders and using those inputs to train and shape the behavior of our models. We’re excited to combine our research with ideas and prototypes developed by the grant teams in the coming months.

In this update, we will cover:

How our grant recipients innovated on democratic technology
Key learnings from the grant program
Our implementation plans

How our grant recipients innovated on democratic technology

We received nearly 1,000 applications across 113 countries. There were far more than 10 qualified teams, but a joint committee of OpenAI employees and external experts in democratic governance selected the final 10 teams to span a set of diverse backgrounds and approaches: the chosen teams have members from 12 different countries and their expertise spans various fields, including law, journalism, peace-building, machine learning, and social science research.

During the program, teams received hands-on support and guidance. To facilitate collaboration, teams were encouraged to describe and document their processes in a structured way (via “process cards” and “run reports”). This enabled faster iteration and easier identification of opportunities to integrate with other teams’ prototypes. Additionally, OpenAI facilitated a special Demo Day in September for the teams to showcase their concepts to one another, OpenAI staff, and researchers from other AI labs and academia.

The projects spanned different aspects of participatory engagement, such as novel video deliberation interfaces, platforms for crowdsourced audits of AI models, mathematical formulations of representation guarantees, and approaches to map beliefs to dimensions that can be used to fine-tune model behavior. Notably, across nearly all projects, AI itself played a useful role as a part of the processes in the form of customized chat interfaces, voice-to-text transcription, data synthesis, and more.

Today, along with lessons learned, we share the code that teams created for this grant program, and present brief summaries of the work accomplished by each of the ten teams:

⚖️ Case Law for AI Policy

Report Contact

Creating a robust case repository around AI interaction scenarios that can be used to make case-law-inspired judgments through a process that democratically engages experts, laypeople, and key stakeholders.

💬 Collective Dialogues for Democratic Policy Development

Report Contact

Developing policies that reflect informed public will using collective dialogues to efficiently scale democratic deliberation and find areas of consensus.

🤝 Deliberation at Scale: Socially democratic inputs to AI

Report Contact

Enabling democratic deliberation in small group conversations conducted via AI-facilitated video calls.

🦉 Democratic Fine-Tuning

Report Website Contact

Eliciting values from participants in a chat dialogue in order to create a moral graph of values that can be used to fine-tune models.

⚡ Energize AI: Aligned - a Platform for Alignment

Report Website Contact

Developing guidelines for aligning AI models with live, large-scale participation and a 'community notes' algorithm.

👫 Generative Social Choice

Report Contact

Distilling a large number of free-text opinions into a concise slate that guarantees fair representation using mathematical arguments from social choice theory.

🌎 Inclusive.AI: Engaging Underserved Populations in Democratic Decision-Making on AI

Report Blog post Contact

Facilitating decision-making processes related to AI using a platform with decentralized governance mechanisms (e.g., a DAO) that empower underserved groups.

📰 Making AI Transparent and Accountable by Rappler

Report Contact

Enabling discussion and understanding of participants' views on complex, polarizing topics via linked offline and online processes.

🎨 Ubuntu-AI: A Platform for Equitable and Inclusive Model Training

Report Contact

Returning value to those who help create it while facilitating LLM development and ensuring more inclusive knowledge of African creative work.

🔁 vTaiwan and Chatham House: Bridging the Recursive Public

Report Website Contact

Using an adapted vTaiwan methodology to create a recursive, connected participatory process for AI.

Key learnings from the grant program so far

Public opinion can change frequently

Teams captured views in multiple ways. Many teams found that public views changed often.

The Democratic Fine-Tuning team created a chatbot that presented scenarios to participants and produced “value cards” that participants could review and evaluate. The Case Law team held expert workshops, and represented their opinions as a set of dimensions and considerations over a specific set of scenarios. The Inclusive.AI team captured both statements and how strongly people felt about these statements by allowing them to distribute voting tokens across many statements (versus a single vote). Many other teams presented statements accompanied by the proportion of participants in support.
Interestingly, many teams found that public opinion changed frequently, even day-to-day, which could have meaningful implications for how frequently input-collecting processes should take place. A collective process should be thorough enough to capture hard-to-change and perhaps more fundamental values, and simultaneously be sensitive enough (or recur frequently enough) to detect meaningful changes of views over time.

Bridging across the digital divide is still difficult and this can skew results

Reaching relevant participants across digital and cultural divides might require additional investments in better outreach and better tooling.

Some teams found that participants recruited online leaned more optimistic toward AI, a trait that was correlated with increased support and enthusiasm for AI model behavior in general.
Furthermore, due to lack of reach or availability on most platforms we consulted, most teams faced serious difficulty in recruiting participants across the digital divide.
More subtly, even when citizens of global majority countries are included, the tools might be less useful to them due to limitations in understanding the local language or context. For example, in their online and onground focus group discussions, the Rappler team found that the documented disparities in performance across languages of readily available speech recognition tools like Whisper made transcription difficult in participants’ spoken languages, e.g. Tagalog, Binisaya, Hiligaynon, which are major Filipino languages.
The Ubuntu-AI team chose to directly incentivize participation, by developing a platform that allows African creatives to receive compensation for contributing to machine learning about their own designs and backgrounds.

Finding agreement within polarized groups

Finding a compromise can be hard when a small group has strong opinions on a particular issue.

The Collective Dialogues team found that each session always contained a small group of people who felt strongly that restricting AI assistants from answering certain questions was wrong no matter what. In this case, because the group was small, majority voting yielded outcomes that they strongly disagreed with.
The Collective Dialogues, Energize.AI, and Recursive Public teams’ processes were designed to find policy proposals that would be strongly supported across polarized groups. For example, all policy guidelines generated by the Collective Dialogues process with U.S. participants —including on vaccine information, a known divisive issue— had over 72% support across Democrats, Independents, and Republicans.

Reaching consensus vs. representing diversity

When trying to produce a single outcome or make a single decision to represent a group, there might be tension between trying to reach consensus and adequately representing the diversity of various opinions. It’s not just about siding with the majority, but also giving a platform to different viewpoints.

The Generative Social Choice team devised a method that highlights a few key positions, showcasing the range of opinions while finding some common ground. They used mathematical theory to help navigate this balance.
Meanwhile, the Inclusive.AI team investigated different voting mechanisms and how they are perceived. They found that methods which show how strongly people feel about their choices, and that ensure everyone has an equal say, are perceived as more democratic and fair.

Hopes and anxieties about the future of AI governance

Some participants felt nervous about the use of AI in writing policy and would like transparency regarding when and how AI is applied in democratic processes. Post-deliberation sessions, many teams found that participants became more hopeful about the public's ability to help guide AI.

In collaborations with a municipal government and roundtables with various stakeholders, both the Deliberation at Scale and Recursive Public teams found that while there is clear interest in the role AI itself might play in improving democratic processes, there is also an air of caution around how much power or influence democratic institutions should grant to these systems and their developers.
The Collective Dialogues team found that combining AI in a decision making process with non-AI decision steps – like expert curation of AI-generated policy clauses, or a final vote on a policy informed by AI – resulted in a process that had AI-enabled efficiency while still being perceived as trustworthy and legitimate by the public.
In the Collective Dialogues team’s process, a popular clause emerged during deliberations – across different participant groups – which roughly states that the chosen policy should be “expanded on and updated regularly as new issues arise, better understanding is developed, and AI's capabilities evolve.” On average, this clause was supported by 85% of participants.
The Collective Dialogues and Deliberation at Scale teams found that the act of participating in a deliberation session on an AI policy issue made people more likely to think the public was capable of helping guide AI behavior in general.

Our implementation plans

Our goal is to design systems that incorporate public inputs to steer powerful AI models while addressing the above challenges. To help ensure that we continue to make progress on this research, we are forming a “Collective Alignment” team consisting of researchers and engineers that will:

Implement a system for collecting and encoding public input on model behavior into our systems.
Continue to work with external advisors and grant teams, including running pilots to incorporate the grant prototypes into steering our models.

We are recruiting exceptional research engineers from diverse technical backgrounds to help build this work with us. If you’re excited about what we’re doing and would like to apply, please apply to join us!