ByteDance plus Tiktok Plus OpenAi LLM
ByteDance, the parent company of TikTok, found itself in a controversial situation with OpenAI and Microsoft over its use of GPT-generated data. Reports emerged that ByteDance was using OpenAI's technology, accessed through Microsoft Azure, to develop its own large language model (LLM), codenamed Project Seed. This development allegedly involved the extensive use of OpenAI's API in various stages, including training and model evaluation. Internal documents indicated that ByteDance relied on OpenAI's technology during the early phases of Project Seed's development.
OpenAI, in response to these revelations, suspended ByteDance's account. Niko Felix, a spokesperson from OpenAI, stated that all API customers must adhere to OpenAI's usage policies. The suspension was a precautionary measure while OpenAI investigated the matter further. According to Felix, ByteDance's use of the API was minimal, but if found guilty of misusing the API, ByteDance would need to make necessary changes or face the termination of their account.
In defense, ByteDance spokesperson Jodi Seth claimed that GPT-generated data was initially used for annotating Project Seed but was removed from the training data around mid-2023. Seth also mentioned that ByteDance is licensed by Microsoft to use the GPT APIs, primarily to drive products and features in non-China markets, while in China, ByteDance relies on its self-developed model to support the China-exclusive Doubao chatbot platform.
This situation highlighted ByteDance's efforts to compete in the generative AI race, despite its previous leadership in AI. The company's discreet use of OpenAI's technology to create a rival LLM went against OpenAI's terms of service, which prohibit using its model output to develop competing AI models. The controversy also raised questions about whether Microsoft would follow OpenAI's lead and suspend ByteDance's access to Azure services, though Microsoft had not yet responded to the allegations at the time of the reports.
The incident underscores the competitive and complex nature of AI development, as well as the ethical and legal challenges that companies face in leveraging existing technologies to build new products. It also highlights the importance of adhering to licensing agreements and usage policies in the rapidly evolving field of AI.