Problems configuring OE Replication

Posted by James Palmer on 31-Oct-2016 11:28

Progress 10.2B08 on Windows servers

Trying to get OE Replication running on a client site. Normally have few issues but struggling today for some reason. 

Source and Target have identical repl.properties files. 

[server]
    control-agents=l_idx41_pctrl
    database=pctrl
    defer-agent-startup=10080
    transition=manual
    transition-timeout=600
    agent-shutdown-action=recovery
    
[control-agent.l_idx41_pctrl]
   name=l_idx41_pctrl
   database=pctrl
   host=cmlsql03
   port=38140
   connect-timeout=86400
   replication-method=async
   critical=0

[agent]
    name=l_idx41_pctrl
    database=pctrl
    connect-timeout=86400
    listener-maxport=4500
    listener-minport=4387

[transition]
    database-role=reverse
    restart-after-transition=0
    auto-begin-ai=1
    transition-to-agents=l_idx41_pctrl

When I Restart Server and check status it is 1100 on source, 3104 on target. The port 38140 is the port the database is started on on the Target DB. That's correct is it not? 

Any ideas? 

All Replies

Posted by Libor Laubacher on 31-Oct-2016 12:15

38140 is indeed the -S port of/for the target db.

What does the pctrl.lg file say on the source side after the dsrutil pctrl -C restart server ?

1100 is connecting ....

It's not the properties, I used yours above for test and

2016/10/31@18:12:49.372+0100] P-14772      T-15748 I RPLS    6: (11251) The Replication Server successfully connected to all of it's configured Agents.

[2016/10/31@18:12:49.372+0100] P-14772      T-15748 I RPLS    6: (10436) The source database pctrl and the target database D:\repltest2\secondary\pctrl on host localhost are synchronized.

Posted by James Palmer on 31-Oct-2016 12:29

Looking for what you're asking for and found the following which I didn't find earlier:

[2016/10/31@16:26:54.816+0000] P-14908      T-15416 I RPLS   18: (-----) Connection timeout for host cmlsql03 port 38140 transport TCP.

[2016/10/31@16:27:44.818+0000] P-14908      T-15416 I RPLS   18: (-----) Connection timeout for host cmlsql03 port 38140 transport TCP.

[2016/10/31@16:27:52.819+0000] P-14908      T-15416 I RPLS   18: (10396) The Fathom Replication Server cannot connect to the database broker on cmlsql03 at the port -27396.

[2016/10/31@16:27:52.819+0000] P-14908      T-15416 I RPLS   18: (10397) The connection attempt to the Fathom Replication Agent l_idx41_pctrl failed.

[2016/10/31@16:27:52.819+0000] P-14908      T-15416 I RPLS   18: (11241) The Replication Server will continue to attempt Agent connections until Mon Nov 07 16:21:12 2016.

[2016/10/31@16:28:14.838+0000] P-6496       T-7964  I SRV    10: (742)   Login usernum 90, userid PSLAdmin4 client type ABL , on CMLSQL01 using TCP/IP IPV4 address 127.0.0.1.

We have 9 databases in the application and 10.2B suffers from that horrible 'feature' where database connect/disconnect messages are written for all databases in the session to each db log file, so there's a lot of noise. Easy to miss stuff! :(

Posted by Libor Laubacher on 01-Nov-2016 05:40

This would suggest that either target database is not started on 38140, or the agent is not running (I believe you said it was), or there is a firewall between source and target.

Posted by James Palmer on 01-Nov-2016 06:56

Thanks Libor. Fairly sure it's firewall related. Got to wait for the datacentre to think about investigating now.

Posted by James Palmer on 01-Nov-2016 12:30

Rather than create a new thread, it seems the datacentre hosting the client's servers is having a bit of a nightmare.

They tried to open the ports up on the firewall for this and ended up breaking connectivity for the application.

So, they want to have a definitive list of what ports need to be open in order for the application to work. Is it just the ports that the databases are running on, or are there others?

Posted by Libor Laubacher on 01-Nov-2016 18:58

How can opening up the ports break the connectivity ? :-D

I don't know what does your app need port wise - aside of the client server that is.

For the replication itself between the source and target it would be:

port=38140

range from listener-minport=4387 to listener-maxport=4500 (you can tighten this up to like 2 ports range per DB set, if you are replicating more than 1 set of databases)

For the client-server connectivity it would be -S and range from (database) minport to (database) maxport.

if you have appserver/webspeed etc - there is more.

Posted by James Palmer on 02-Nov-2016 04:55

Thanks Libor. I suspect that they changed from a blacklist to a whitelist or something. They were meant to be opening up the connectivity between Source and Target, but broke the connectivity between the Citrix boxes and the source in the process. No idea what they did. Bit useless though.

The annoying thing is we get it in the neck. The datacentre never makes mistakes apparently.

This thread is closed