c++compilationlibmagic

Compiling libmagic statically (c/c++ file type detection)


Thanks to the guys that helped me with my previous question (linked just for reference).

I can place the files fileTypeTest.cpp, libmagic.a, and magic in a directory, and I can compile with g++ -lmagic fileTypeTest.cpp fileTypeTest. Later, I'll be testing to see if it runs in Windows compiled with MinGW.

I'm planning on using libmagic in a small GUI application, and I'd like to compile it statically for distribution. My problem is that libmagic seems to require the external file, magic. (I'm actually using my own shortened and compiled version, magic_short.mgc, but I digress.)

A hacky solution would be to code the file into the application, creating (and deleting) the external file as needed. How can I avoid this?

added for clarity:

magic is a text file that describes properties of different filetypes. When asked to identify a file, libmagic searches through magic. There is a compiled version, magic.mgc that works faster. My application only needs to identify a handful of filetypes before deciding what to do with them, so I'll be using my own magic_short file to create magic_short.mgc.


Solution

  • This is tricky, I suppose you could do it this way... by the way, I have downloaded the libmagic source and looking at it...

    There's a function in there called magic_read_entries within the minifile.c (this is the pure vanilla source that I downloaded from sourceforge where it is reading from the external file.

    You could append the magic file (which is found in the /etc directory) to the end of the library code, like this cat magic >> libmagic.a. In my system, magic is 474443 bytes, libmagic.a is 38588 bytes.

    In the magic.c file, you would need to change the magichandle_t* magic_init(unsigned flags) function, at the end of the function, add the line magic_read_entries and modify the function itself to read at the offset of the library itself to pull in the data, treat it as a pointer to pointer to char's (char **) and use that instead of reading from the file. Since you know where the offset is to the library data for reading, that should not be difficult.

    Now the function magic_read_entries will no longer be used, as it is not going to be read from a file anymore. The function `magichandle_t* magic_init(unsigned flags)' will take care of loading the entries and you should be ok there.

    If you need further help, let me know,

    Edit: I have used the old 'libmagic' from sourceforge.net and here is what I did:

    1. Extracted the downloaded archive into my home directory, ungzipping/untarring the archive will create a folder called libmagic.
    2. Create a folder within libmagic and call it Test
    3. Copy the original magic.c and minifile.c into Test
    4. Using the enclosed diff output highlighting the difference, apply it onto the magic.c source.
    48a49,51
    > #define MAGIC_DATA_OFFSET     0x971C
    > #define MAGIC_STAT_LIB_NAME "libmagic.a"
    >
    125a129,130
    >       /* magic_read_entries is obsolete... */
    >       magic_read_entries(mh, MAGIC_STAT_LIB_NAME);
    251c256,262
    <
    ---
    >
    >       if (!fseek(fp, MAGIC_DATA_OFFSET, SEEK_SET)){
    >               if (ftell(fp) != MAGIC_DATA_OFFSET) return 0;
    >       }else{
    >               return 0;
    >       }
    >
    
    40c40
    <       magic_read_entries(mh,"magic");
    ---
    >       /*magic_read_entries(mh,"magic");*/
    
    

    It should work then. If not, you will need to adjust the offset into the library for reading by modifying the MAGIC_DATA_OFFSET. If you wish, I can stick up the magic data file into pastebin. Let me know.

    Hope this helps, Best regards, Tom.