• IHawkMike@lemmy.world
      link
      fedilink
      arrow-up
      33
      ·
      edit-2
      2 months ago

      You’re not missing much. A few modern file types are zips with expected folder structures, especially MSOffice files. But this is nowhere near universally true.

      You can open a file in your text editor of choice and if you see it start with PK (for Phil Katz the creator of the format and the original PKZIP/PKUNZIP programs) then it’s probably a zip.

      Also, by the logic of the OP, all DLLs are EXEs.

    • cron@feddit.org
      link
      fedilink
      arrow-up
      21
      ·
      2 months ago

      OP refers to the fact that you can rename some filetypes to .zip and unpack them.

      Notable examples microsoft office files (.docx) or android apps (.apk).

      Counterexample are media files (mp3, mp4, jpg).

      • mumblerfish@lemmy.world
        link
        fedilink
        arrow-up
        1
        ·
        2 months ago

        OP refers to the fact that you can rename some filetypes to .zip and unpack them.

        So… you mean the zip program just rename them back? Why?

        • cron@feddit.org
          link
          fedilink
          arrow-up
          14
          ·
          2 months ago

          I think it makes sense from a programming view. When you have a document, you can add all the media files and pack them together as one archive. Then the program sets the filename to .docx so everyone knows that they need an office program to open that file.

          For the users, all you need to know is what program can open which files. If every document would be named .zip, you would have no idea if it was a spreadsheet or slides for your presentation.

          • mumblerfish@lemmy.world
            link
            fedilink
            arrow-up
            1
            ·
            2 months ago

            I got that from the other answers. I was just very confused why I’d have to rename them to “.zip”.

            I still don’t get why it is “most” files.

            • cron@feddit.org
              link
              fedilink
              arrow-up
              7
              ·
              2 months ago

              I don’t think “most” applies here. Text-based files, pdf, media files and most executeable files are not .zip.

    • thevoidzero@lemmy.world
      link
      fedilink
      arrow-up
      11
      ·
      2 months ago

      There are basically two types of files. Text files and binary files.

      Most information are stored in text files so humans can easily understand it, and it’s easier to find errors, review, parse. But text storage takes more space than binary files. And many complicated softwares normally need multiple text files or data files, many of them just store them together as a zip file so that it’s easier to handle. Examples are .docx,.pptx, etc files in MS Office, try unzipping them and see what they contain. Zipping also has advantages of reducing file sizes.

    • mumblerfish@lemmy.world
      link
      fedilink
      arrow-up
      4
      ·
      2 months ago

      OK, thanks for all the answers. I get it, a “docx” is a zip archive expected to contain something specific making it a docx. But why “most” though?

      • Acamon@lemmy.world
        link
        fedilink
        arrow-up
        3
        ·
        2 months ago

        I think ‘most’ is hyperbole for dramatic effect / increased engagement. “more files than you might think are actually following the zip file structure” isn’t as punchy.

        • Interstellar_1OP
          link
          fedilink
          arrow-up
          3
          ·
          2 months ago

          I just didn’t think of too many file extensions when I had this thought. I was also thinking of more obscure file extensions, and not the main media formats.