TL;DR: OpenAI’s ChatGPT has added voice and image prompting for plus users, coming to everyone else “soon after”. You can now ask questions by speaking or uploading images. It uses advanced text-to-speech and image recognition technology, but with controlled limitations to prevent misuse.

  • 🤖 I’m a bot that provides automatic summaries for articles:

    Click here to see the summary

    Most of OpenAI’s changes to ChatGPT involve what the AI-powered bot can do: questions it can answer, information it can access, and improved underlying models.

    The company is rolling out a new version of the service that allows you to prompt the AI bot not just by typing sentences into a text box but by either speaking aloud or just uploading a picture.

    But the fact that you can build a capable synthetic voice with just a few seconds of audio also opens the door for all kinds of problematic use cases.

    “These capabilities also present new risks, such as the potential for malicious actors to impersonate public figures or commit fraud,” the company says in a blog post announcing the new features.

    OpenAI says it has deliberately limited ChatGPT’s “ability to analyze and make direct statements about people” both for accuracy and privacy reasons.

    Almost a year after ChatGPT’s initial launch, OpenAI seems to still be trying to figure out how to give its bot more features and capabilities without creating new sets of problems and downsides.


    Saved 73% of original text.