DeepSeek’s Safety Guardrails Fails Every Test (Wired.com)

DeepSeek has potholes too! Is this schadenfreude? Is this expected? What do we learn from this?

https://www.wired.com/story/deepseeks-ai-jailbreak-prompt-injection-attacks/

DeepSeek is all anyone is talking about in AI. I consider the “great leap forward” from DeepSeek a “flash-in-the-pan.” We knew that algorithms would improve in the AI space. Not everyone is blabbing online about what they’re doing. The tools will improve, the tech will improve, in fits and starts.

As the quote goes: We overestimate what technology can do in the short term, and we underestimate it in the long term. A version of this quote is attributed to Bill Gates, but it has been said a number of times going back to the 1960’s.

It is true here as well.

At the same time, I am thankful for researchers, skeptics and naysayers who test the tech and are willing to point out the potholes in our progress.

Yes, there are potholes. Yes, there is progress. Yes, there are more unexpected revelations ahead. The question is, how might we best prepare ourselves for these surprises?

  • Read broadly
  • Take cybersecurity precautions
  • Don’t be the first to download and test the latest and greatest thing unless you have a high tolerance for failure and unexpected outcomes (including being hacked)
  • Watch your industry and other industries for what they’re doing and what ideas they have
  • Build and keep strong relationships with colleagues doing similar work
  • Keep your eyes open

Be ready for sudden acceleration in unexpected directions and unexpected times.

If we use the Psycho 80 (my invented term) as a guide, we remember that 80% of success in information technology is about psycho-social-political interpersonal skill and 20% is about the technology.

Author: CT Lin

CMIO, UCHealth (Colorado); Professor, University of Colorado School of Medicine

Leave a Reply

Discover more from The Undiscovered Country

Subscribe now to keep reading and get access to the full archive.

Continue reading