interopc#-2.0office-2003

How to extract bullet information from word document?


I want to extract information of bullets present in word document. I want something like this : Suppose the text below, is in word document :

Steps to Start car :

Then I want my text file like below :

Steps to Start car :

<BULET> Open door </BULET>

<BULET> Sit inside </BULET>

<BULET> Close the door </BULET>

<BULET> Insert key </BULET>

<BULET> etc.</BULET>

I am using C# language to do this.

I can extract paragraphs from word document and directly write them in text file with some formatting information like whether text is bold or is in italics, etc. but dont know how to extract this bullet information.

Can anyone please tell me how to do this?

Thanks in advance


Solution

  • I got the answer.....

    First I was converting doc on paragraph basis. But instead of that if we process doc file sentence by sentence basis, it is possible to determine whether that sentence contains bullet or any kind of shape or if that sentence is part of table. So once we get this information, then we can convert that sentence appropriately. If someone needs source code, I can share it.