cancel
Showing results for 
Search instead for 
Did you mean: 
cancel
Showing results for 
Search instead for 
Did you mean: 

Community Tip - Your Friends List is a way to easily have access to the community members that you interact with the most! X

word import and Reimport config

jwangpeng
1-Newbie

word import and Reimport config

Hi everyone:

     I want to import a word document into Integrity , but when I finished import,I found that some description of a caption has been change to a child level captions.

how can i update the config to change it right?


example:

word content befor import:

     1. title one

      this is the first title.

     1.1 xxxx

     2.title two

     this is the second title.


after import:

     1. title one

      1.1 this is the first title.

      1.2 xxxx

     2.title two

      2.1this is the second title.

2 REPLIES 2
mrump
14-Alexandrite
(To:jwangpeng)

This a tricky one:

First some basics about WORD:

your sample (that looks like 3 headings and 2 paragraphs of text)    

1. title one

      this is the first title.

     1.1 xxxx

     2.title two

     this is the second title.


technically contains of 5 paragraphs in WORD. You can review that when storing it as html. A Heading in WORD is the same paragraph as all others, only applied a different style.


2. Your Integrity Import XSL seems to be working or paragraph Level. That explains the Output:

The Gateway Import basically goes through the html Version of the WORD document from top to bottom and "converts" each paragraph based on it's style into different categories of items.

Each item representing one paragraph, just like in WORD (only that now the inner structure is obvious).


If you want to remain the "structure" seen in WORD, you will have to Change the entire Parser.

e.g.

go through the document from top to bottom and concat all text you find between 2 Headings into one Item.

Each Item must contain 2 text fields, the potential Heading Text and the Content Text


The result would be 3 Items in Integrity

Section - Heading Text - Content Text

1.          - title one           - this is the first title.

1.1        - xxx                  - <empty>

2.          - title two           - this is the second title.

I would recommend to contact PTC for help with that Parser or just accept what you've already have

Hi Matthias Rump:

     thanks for you reply, the last result you provided tha is I want , in fact ,the final working sketch is in the follow pictures.

can you help me to config it right ?

2.png

Top Tags