OpenAI has introduced a new security feature called Lockdown Mode for ChatGPT, which disables web access and other functions to protect users from potential data theft. The feature blocks all internet and external service connections, preventing sensitive data from being leaked during conversations with the AI. Users can activate Lockdown Mode in the security settings and temporarily disable it for individual conversations when broader functionality is needed. This mode is designed to safeguard users handling sensitive data by restricting features that connect ChatGPT to the internet or external services.

Lockdown Mode limits live web search to cached content, which may be outdated or unavailable. It fully disables Deep Research and Agent Mode, and ChatGPT can no longer download files or display web images in regular responses. Network access for Canvas-generated code is also blocked. OpenAI acknowledges that prompt injection remains an unsolved problem, calling it a 'frontier, challenging research problem' they are working to solve. Despite these measures, the mode only blocks the final step in an exfiltration chain, the attempt to send data to an attacker via network requests. OpenAI's FAQ states that prompt injection 'is not currently a major risk,' but the impact 'could grow as attackers develop more sophisticated methods.'

Lockdown Mode is available for personal accounts and self-managed ChatGPT Business accounts under 'Settings > Security.' In managed workspaces, admins can set up the mode through role-based access controls (RBAC) for individual members or groups. Users can temporarily turn off Lockdown Mode for specific chats when they need full functionality for a particular conversation. Lockdown Mode and Developer Mode are mutually exclusive. For apps and connectors, OpenAI distinguishes by account type, allowing personal account users to use connectors that access already-synced data while blocking live access, write actions, and finance and shopping features. In managed workspaces, OpenAI recommends admins only enable trusted apps and evaluate each one's exfiltration risk individually.

Source: thedecoder